Skip to main content
Log in

Examining the Information Content of Residuals from Hedonic and Spatial Models Using Trees and Forests

  • Published:
The Journal of Real Estate Finance and Economics Aims and scope Submit manuscript

Abstract

Machine learning algorithms such as neural nets, support vector machines, and tree-based techniques (classification and regression trees) have shown great success in dealing with a number of complex problems (Hastie et al. 2009). However, real estate data exhibit both temporal dependence and high levels of spatial dependence (Pace et al., International Journal of Forecasting16(2), 229–246, 2000; LeSage and Pace 2009) that may make it harder to use with off-the-shelf machine learning procedures. We examine tree-based techniques (CART, boosting, and bagging) and compare these to spatiotemporal methods. We find that bagging works well and can give lower ex-sample residuals than global spatiotemporal methods, but do not perform better than local spatiotemporal methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Notes

  1. See Borst (1992), Quang Do and Grudnitski (1993), Borst and McCluskey (1997), and Nguyen and Cripps (2001) for some earlier work. Chiarazzo et al. (2014) represent some more recent work in this area.

  2. These include Grand Prairie, Irving, Plano, Farmers branch, Garland, Mesquite, Highland Park, University Park, Euless, and Grapevine.

  3. Pace et al. (2012) show that dependence in the explanatory variables can make an impact in several scenarios and can account for the aberrant estimates from IV estimators that sometimes occurs in practice.

  4. The actual observations were fewer than these amounts because of filters applied such as requiring 500 or more square feet as described in the text associated with Table 1.

References

  • Bogin, A.N., & Shui, J. (2018). Appraisal accuracy, automated valuation models, and credit modeling in rural areas, Working Paper 18-03, Federal Housing Finance Agency.

  • Borst, R.A. (1992). Artificial neural networks: The next modelling/calibration technology for the assessment community. Artificial Neural Networks, 64–94.

  • Borst, R.A., & McCluskey, W.J. (1997). An evaluation of MRA, comparable sales analysis and ANNs for the mass appraisal of residential properties in Northern Ireland. Assessment Journal, 4(1), 47–55.

    Article  Google Scholar 

  • Brunsdon, C., Fotheringham, A.S., Charlton, M.E. (1996). Geographically weighted regression: A method for exploring spatial non-stationarity. Geographical Analysis, 28, 281–298.

    Article  Google Scholar 

  • Chiarazzo, V., Caggiani, L., Marinelli, M., Ottomanelli, M. (2014). A neural network based model for real estate price estimation considering environmental quality of property location. Transportation Research Procedia, 3, 810–817.

    Article  Google Scholar 

  • Hastie, T., Tibshirani, R., Friedman, J. (2009). The elements of statistical learning: data mining, inference, and prediction, 2nd edn. (Springer Series in Statistics).

  • LeSage, J.P., & Pace, R.K. (2009). Introduction to spatial econometrics, CRC Press.

  • McMillen, D.P. (1996). One hundred fifty years of land values in Chicago: A nonparametric approach. Journal of Urban Economics, 40, 100–124.

    Article  Google Scholar 

  • McMillen, D.P., & McDonald, J.F. (1997). A nonparametric analysis of employment density in a polycentric City. Journal of Regional Science, 37, 591–612.

    Article  Google Scholar 

  • Nguyen, N., & Cripps, A. (2001). Predicting housing value: A comparison of multiple regression analysis and artificial neural network. Journal of Real Estate Research, 22(3), 313–336.

    Google Scholar 

  • Pace, R.K., & Gilley, O.W. (1998). Generalizing the OLS and grid estimators. Real Estate Economics, 26(2), 331–347.

    Article  Google Scholar 

  • Pace, R.K., & LeSage, J.P. (2004). Spatial autoregressive local estimation, Spatial econometrics and spatial statistics, (pp. 31–51). New York: Palgrave MacMillan.

    Google Scholar 

  • Pace, R.K., & LeSage, J.P. (2008). A spatial Hausman test. Economics Letters, 101(3), 282–284.

    Article  Google Scholar 

  • Pace, R.K., Barry, R., Gilley, O.W., Sirmans, C.F. (2000). A method for spatialtemporal forecasting with an application to real estate prices. International Journal of Forecasting, 16(2), 229–246.

    Article  Google Scholar 

  • Pace, R.K., LeSage, J.P., Zhu, S. (2012). Spatial dependence in regressors and its effect on performance of likelihood-based and instrumental variable estimators, Advances in Econometrics, 30th Anniversary Edition. Emerald, 257–295.

  • Quang Do, A., & Grudnitski, G. (1993). A neural network analysis of the effect of age on housing values. Journal of Real Estate Research, 8(2), 253–264.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Kelley Pace.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pace, R.K., Hayunga, D. Examining the Information Content of Residuals from Hedonic and Spatial Models Using Trees and Forests. J Real Estate Finan Econ 60, 170–180 (2020). https://doi.org/10.1007/s11146-019-09724-w

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11146-019-09724-w

Keywords

Navigation