Skip to main content

Relationships Between Average Depth and Number of Nodes for Decision Trees

  • Conference paper
  • First Online:
Knowledge Engineering and Management

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 214))

Abstract

This paper presents a new tool for the study of relationships between total path length or average depth and number of nodes of decision trees. In addition to algorithm, the paper also presents the results of experiments with datasets from UCI ML Repository [1].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Frank A, Asuncion A (2010) UCI Machine Learning Repository

    Google Scholar 

  2. Alkhalid A, Chikalov I, Moshkov M (2010) On algorithm for building of optimal \(\alpha \)-decision trees. In: Szczuka MS, Kryszkiewicz M, Ramanna S, Jensen R, Hu Q (eds) RSCTC. Springer, Heidelberg, pp 438–445

    Google Scholar 

  3. Alkhalid A, Chikalov I, Moshkov M (2010) A tool for study of optimal decision trees. In: Yu J, Greco S, Lingras P, Wang G, Skowron A (eds) RSKT. LNCS, vol 6401. Springer, Heidelberg, pp 353–360

    Google Scholar 

  4. Alkhalid A, Chikalov I, Hussain S, Moshkov M (2012) In: Extensions of dynamic programming as a new tool for decision tree optimization. SIST, vol 13. Springer, Heidelberg, pp 16–36

    Google Scholar 

  5. Alkhalid A, Amin T, Chikalov I, Hussain S, Moshkov M, Zielosko B (2011) Dagger: a tool for analysis and optimization of decision trees and rules. In: Francisco V. C. Ficarra (ed) Computational informatics, social factors and new information technologies: hypermedia perspectives and avant-garde experiencies in the Era of communicability expansion. Blue Herons, Bergamo, pp 29–39

    Google Scholar 

  6. Chikalov I, Hussain S, Moshkov M (2011) Relationships between depth and number of missclassifications for decision trees. In: Kuznetsov SO, Slezak D, Hepting DH, Mirkin B (eds) Thirteenth international conference on rough sets, fuzzy sets, data mining and granualr computing (RSFDGrC 2011). LNCS, vol 6743. Springer, Heidelberg, pp 286–292

    Google Scholar 

  7. Pawlak Z (1991) Theoretical aspects of reasoning about data. Kluwer Academic Publishers, Dordrecht

    Google Scholar 

  8. Skowron A, Rauszer C (1992) The discernibility matrices and functions in information systems. In: Slowinski R (ed) Intelligent decision support. Handbook of applications and advances of the rough set theory. Kluwer Academic Publishers, Dordrecht, pp 331–362

    Google Scholar 

  9. Nguyen HS (1998) From optimal hyperplanes to optimal decision trees. Fundam Inf 34(1–2):145–174

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shahid Hussain .

Editor information

Editors and Affiliations

Appendix: Transformation of Functions

Appendix: Transformation of Functions

Let \(f\) and \(g\) be two functions from a set \(A\) onto \(C_f\) and \(C_g\) respectively, where \(C_f\) and \(C_g\) are finite sets of nonnegative integers. Let \(B_f = \{m_f, m_f+1, \ldots , M_f\}\) and \(B_g = \{n_g, n_g+1, \ldots , N_g\}\) where \(m_f = \min \{m : m\in C_f\}\) and \(n_g = \min \{n: n\in C_g\}\). Furthermore, \(M_f\) and \(N_g\) are natural numbers such that \(m\le M_f\) and \(n\le N_g\) for any \(m\in C_f\) and \(n\in C_g\), respectively.

We define two functions \({\mathcal{F}} :B_g \rightarrow B_f \) and \({\mathcal{G}} :B_f \rightarrow B_g\) as follows:

$${\mathcal{F}} (n) = \min \{f(a) : a\in A, g(a) \le n\},\; \forall n\in B_g,$$
(1)
$${\mathcal{G}}(m) = \min \{g(a) : a\in A, f(a) \le m\},\; \forall m\in B_f.$$
(2)

It is clear that both \({\mathcal{F}} \) and \({\mathcal{G}}\) are nonincreasing functions.

The following proposition states that the functions \({\mathcal{F}} \) and \({\mathcal{G}}\) can be used interchangeably and we can evaluate \({\mathcal{F}} \) using \({\mathcal{G}}\) and vice versa, i.e., it is enough to know only one function to evaluate the other.

Proposition 3

For any \(n\in B_g\),

$${\mathcal{F}} (n) = \min \{m\in B_f: {\mathcal{G}} (m) \le n\},$$

and for any \(m\in B_f\),

$${\mathcal{G}} (m) = \min \{n\in B_g: {\mathcal{F}} (n) \le m\}.$$

Proof

Let for some \(n\in B_g\)

$${\mathcal{F}} (n) = m_0.$$
(3)

Furthermore, we assume that

$$\min \{m \in B_f: {\mathcal{G}} (m) \le n\} = t.$$
(4)

From (3) it follows that

  1. (i)

    there exists \(b\in A\) such that \(g(b) \le n\) and \(f(b) = m_0\);

  2. (ii)

    for any \(a\in A\) if \(g(a) \le n\) then \(f(a) \ge m_0\).

From (i) it follows that \({\mathcal{G}} (m_0) \le n\). This implies \(t\le m_0\). Let us assume that \(t< m_0\). In this case, there exits \(m_1<\,m_0\) for which \({\mathcal{G}} (m_1) \le n\). Therefore, there exists \(a\in A\) such that \(f(a) \le m_1\) and \(g(a) \le n\), but from (ii) it follows that \(f(a) \ge m_0\), which is impossible. So \(t=m_0\).

Similarly, we can prove the second part of the statement.

Proposition 3 allows us to transform the function \({\mathcal{G}}\) given by a tuple \(\left( {\mathcal{G}} (m_f), {\mathcal{G}} (m_f+1), \ldots , {\mathcal{G}} (M_f)\right) \) into the function \({\mathcal{F}} \) and vice versa. We know that \({\mathcal{G}} (m_f) \ge {\mathcal{G}} (m_f + 1) \ge \cdots \ge {\mathcal{G}} (M_f)\), to find the minimum \(m\in B_f\) such that \({\mathcal{G}} (m) \le m\) we can use binary search which requires \(O(\log |B_f|)\) comparisons of numbers. So to find the value \({\mathcal{F}} (n)\) for \(n\in B_g\) it is enough to make \(O(\log |B_f|)\) operations of comparison.

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chikalov, I., Hussain, S., Moshkov, M. (2014). Relationships Between Average Depth and Number of Nodes for Decision Trees. In: Sun, F., Li, T., Li, H. (eds) Knowledge Engineering and Management. Advances in Intelligent Systems and Computing, vol 214. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37832-4_47

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37832-4_47

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37831-7

  • Online ISBN: 978-3-642-37832-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics