Skip to main content

Advertisement

Log in

Investigating adoption patterns of residential low impact development (LID) using classification trees

  • Published:
Environment Systems and Decisions Aims and scope Submit manuscript

Abstract

Local governments are under pressure to improve storm water management and often times must comply with consent decrees with the Federal Government. Decentralizing a portion of the storm water management by integrating private landowners into localized retention and infiltration efforts, that is, low impact development (LID) or green infrastructure projects, is becoming increasingly popular. Some wastewater systems have considered incentivizing private land owners to make improvements aimed at retaining storm water or slowing the conveyance to grey infrastructure. This study examines potential opportunities for incentivizing private residential land owners in Washington DC to install LID projects. This study maps LID configurations to a set of adoption strategies and categories. The C4.5 algorithm is then applied to identify a high performance decision tree for classifying parcels by adoption strategy or adoption categories based on property-level attributes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

Download references

Acknowledgements

The authors acknowledge the United States National Science Foundation (NSF RIPS Project No. 1441226) for financial support of this research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Domenico C. Amodeo.

Appendix A

Appendix A

Decision trees are methods developed for classifying observations based on both continuous and discrete attributes. Decision trees partition the data into classes based on a series of tests. Each test results in a branching of the data into either a terminal classification node, or additional test sequence. While numerous decision tree algorithms exist, the authors applied the C4.5 algorithm, developed by J.R. Quinlan (1993). The C4.5 algorithm was implemented in R using the Rweka package. Equation (1) through Eq. (4) are adopted from Quinlan (1993).

The C4.5 algorithm creates a decision tree by developing rules based on “information gain”. This information gain is calculated by first calculating the entropy of the test set using the following Eq. (1):

$${\text{Entrop}}{{\text{y}}_S}= - \mathop \sum \limits_{{j=1}}^{n} \frac{{{\text{freq}}({C_j},S)}}{{\left| S \right|}} \times {\log _2}\left( {\frac{{{\text{freq}}({C_j},S)}}{{\left| S \right|}}} \right)$$
(1)

Next the portion of the entropy attributable to each variable is calculated as in Eq. (2):

$$~{\text{Entrop}}{{\text{y}}_X}= - \mathop \sum \limits_{{i=1}}^{n} \frac{{\left| {{T_i}} \right|}}{{\left| T \right|}} \times {\text{Entrop}}{{\text{y}}_T}$$
(2)

The gain from each variable, X, is calculated as the difference between the result for Eq. 1 for the whole training set and Eq. (2) for each variable.

$${\text{Gai}}{{\text{n}}_X}={\text{Entrop}}{{\text{y}}_T} - ~{\text{Entrop}}{{\text{y}}_X}$$
(3)

The “gain ratio” is calculated by dividing the entropy of the training set (Eq. 1), by the gain associated with each variable as in Eq. (4). It is this criterion that the j48 implementation uses to determine the splitting rules.

$${\text{Gain~Rati}}{{\text{o}}_X}=\frac{{{\text{Gai}}{{\text{n}}_X}}}{{~{\text{Entrop}}{{\text{y}}_X}}}$$
(3)

where X is the variable being assessed and T is the multi-class data previously partitioned in Eq. (3). The gain for each variable is measured as the difference between Eq. (1) applied to a test set and the weighted entropy for each dependent variable. For example, a model with five dependent variables would entail calculating five weighted entropies.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Amodeo, D.C., Francis, R.A. Investigating adoption patterns of residential low impact development (LID) using classification trees. Environ Syst Decis 39, 295–306 (2019). https://doi.org/10.1007/s10669-019-09725-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10669-019-09725-3

Keywords

Navigation