Relationships Between Average Depth and Number of Nodes for Decision Trees

Chikalov, Igor; Hussain, Shahid; Moshkov, Mikhail

doi:10.1007/978-3-642-37832-4_47

Igor Chikalov⁵,
Shahid Hussain⁵ &
Mikhail Moshkov⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 214))

2697 Accesses
4 Citations

Abstract

This paper presents a new tool for the study of relationships between total path length or average depth and number of nodes of decision trees. In addition to algorithm, the paper also presents the results of experiments with datasets from UCI ML Repository [1].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Frank A, Asuncion A (2010) UCI Machine Learning Repository
Google Scholar
Alkhalid A, Chikalov I, Moshkov M (2010) On algorithm for building of optimal $\alpha $-decision trees. In: Szczuka MS, Kryszkiewicz M, Ramanna S, Jensen R, Hu Q (eds) RSCTC. Springer, Heidelberg, pp 438–445
Google Scholar
Alkhalid A, Chikalov I, Moshkov M (2010) A tool for study of optimal decision trees. In: Yu J, Greco S, Lingras P, Wang G, Skowron A (eds) RSKT. LNCS, vol 6401. Springer, Heidelberg, pp 353–360
Google Scholar
Alkhalid A, Chikalov I, Hussain S, Moshkov M (2012) In: Extensions of dynamic programming as a new tool for decision tree optimization. SIST, vol 13. Springer, Heidelberg, pp 16–36
Google Scholar
Alkhalid A, Amin T, Chikalov I, Hussain S, Moshkov M, Zielosko B (2011) Dagger: a tool for analysis and optimization of decision trees and rules. In: Francisco V. C. Ficarra (ed) Computational informatics, social factors and new information technologies: hypermedia perspectives and avant-garde experiencies in the Era of communicability expansion. Blue Herons, Bergamo, pp 29–39
Google Scholar
Chikalov I, Hussain S, Moshkov M (2011) Relationships between depth and number of missclassifications for decision trees. In: Kuznetsov SO, Slezak D, Hepting DH, Mirkin B (eds) Thirteenth international conference on rough sets, fuzzy sets, data mining and granualr computing (RSFDGrC 2011). LNCS, vol 6743. Springer, Heidelberg, pp 286–292
Google Scholar
Pawlak Z (1991) Theoretical aspects of reasoning about data. Kluwer Academic Publishers, Dordrecht
Google Scholar
Skowron A, Rauszer C (1992) The discernibility matrices and functions in information systems. In: Slowinski R (ed) Intelligent decision support. Handbook of applications and advances of the rough set theory. Kluwer Academic Publishers, Dordrecht, pp 331–362
Google Scholar
Nguyen HS (1998) From optimal hyperplanes to optimal decision trees. Fundam Inf 34(1–2):145–174
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Computer, Electrical and Mathematical Sciences and Engineering, King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
Igor Chikalov, Shahid Hussain & Mikhail Moshkov

Authors

Igor Chikalov
View author publications
You can also search for this author in PubMed Google Scholar
Shahid Hussain
View author publications
You can also search for this author in PubMed Google Scholar
Mikhail Moshkov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shahid Hussain .

Editor information

Editors and Affiliations

Department of Computer Science and Technology, Tsinghua University, Beijing, China, People's Republic
Fuchun Sun
School of Information Science and Technology, Southwest Jiaotong University, Chengdu, China, People's Republic
Tianrui Li
Department of Computer Science and Techn, Tsinghua University, Beijing, China, People's Republic
Hongbo Li

Appendix: Transformation of Functions

Let $f$ and $g$ be two functions from a set $A$ onto $C_f$ and $C_g$ respectively, where $C_f$ and $C_g$ are finite sets of nonnegative integers. Let $B_f = \{m_f, m_f+1, \ldots , M_f\}$ and $B_g = \{n_g, n_g+1, \ldots , N_g\}$ where $m_f = \min \{m : m\in C_f\}$ and $n_g = \min \{n: n\in C_g\}$. Furthermore, $M_f$ and $N_g$ are natural numbers such that $m\le M_f$ and $n\le N_g$ for any $m\in C_f$ and $n\in C_g$, respectively.

We define two functions ${\mathcal{F}} :B_g \rightarrow B_f $ and ${\mathcal{G}} :B_f \rightarrow B_g$ as follows:

$${\mathcal{F}} (n) = \min \{f(a) : a\in A, g(a) \le n\},\; \forall n\in B_g,$$

(1)

$${\mathcal{G}}(m) = \min \{g(a) : a\in A, f(a) \le m\},\; \forall m\in B_f.$$

(2)

It is clear that both ${\mathcal{F}} $ and ${\mathcal{G}}$ are nonincreasing functions.

The following proposition states that the functions ${\mathcal{F}} $ and ${\mathcal{G}}$ can be used interchangeably and we can evaluate ${\mathcal{F}} $ using ${\mathcal{G}}$ and vice versa, i.e., it is enough to know only one function to evaluate the other.

Proposition 3

For any $n\in B_g$,

$${\mathcal{F}} (n) = \min \{m\in B_f: {\mathcal{G}} (m) \le n\},$$

and for any $m\in B_f$,

$${\mathcal{G}} (m) = \min \{n\in B_g: {\mathcal{F}} (n) \le m\}.$$

Proof

Let for some $n\in B_g$

$${\mathcal{F}} (n) = m_0.$$

(3)

Furthermore, we assume that

$$\min \{m \in B_f: {\mathcal{G}} (m) \le n\} = t.$$

(4)

From (3) it follows that

(i)
there exists $b\in A$ such that $g(b) \le n$ and $f(b) = m_0$;
(ii)
for any $a\in A$ if $g(a) \le n$ then $f(a) \ge m_0$.

From (i) it follows that ${\mathcal{G}} (m_0) \le n$. This implies $t\le m_0$. Let us assume that $t< m_0$. In this case, there exits $m_1<\,m_0$ for which ${\mathcal{G}} (m_1) \le n$. Therefore, there exists $a\in A$ such that $f(a) \le m_1$ and $g(a) \le n$, but from (ii) it follows that $f(a) \ge m_0$, which is impossible. So $t=m_0$.

Similarly, we can prove the second part of the statement.

Proposition 3 allows us to transform the function ${\mathcal{G}}$ given by a tuple $\left( {\mathcal{G}} (m_f), {\mathcal{G}} (m_f+1), \ldots , {\mathcal{G}} (M_f)\right) $ into the function ${\mathcal{F}} $ and vice versa. We know that ${\mathcal{G}} (m_f) \ge {\mathcal{G}} (m_f + 1) \ge \cdots \ge {\mathcal{G}} (M_f)$, to find the minimum $m\in B_f$ such that ${\mathcal{G}} (m) \le m$ we can use binary search which requires $O(\log |B_f|)$ comparisons of numbers. So to find the value ${\mathcal{F}} (n)$ for $n\in B_g$ it is enough to make $O(\log |B_f|)$ operations of comparison.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chikalov, I., Hussain, S., Moshkov, M. (2014). Relationships Between Average Depth and Number of Nodes for Decision Trees. In: Sun, F., Li, T., Li, H. (eds) Knowledge Engineering and Management. Advances in Intelligent Systems and Computing, vol 214. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37832-4_47

Download citation

DOI: https://doi.org/10.1007/978-3-642-37832-4_47
Published: 24 July 2013
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37831-7
Online ISBN: 978-3-642-37832-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Relationships Between Average Depth and Number of Nodes for Decision Trees

Abstract

Access this chapter

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix: Transformation of Functions

Appendix: Transformation of Functions

Proposition 3

Proof

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation