Abstract
Machine learning algorithms such as neural nets, support vector machines, and tree-based techniques (classification and regression trees) have shown great success in dealing with a number of complex problems (Hastie et al. 2009). However, real estate data exhibit both temporal dependence and high levels of spatial dependence (Pace et al., International Journal of Forecasting16(2), 229–246, 2000; LeSage and Pace 2009) that may make it harder to use with off-the-shelf machine learning procedures. We examine tree-based techniques (CART, boosting, and bagging) and compare these to spatiotemporal methods. We find that bagging works well and can give lower ex-sample residuals than global spatiotemporal methods, but do not perform better than local spatiotemporal methods.
Similar content being viewed by others
Notes
These include Grand Prairie, Irving, Plano, Farmers branch, Garland, Mesquite, Highland Park, University Park, Euless, and Grapevine.
Pace et al. (2012) show that dependence in the explanatory variables can make an impact in several scenarios and can account for the aberrant estimates from IV estimators that sometimes occurs in practice.
The actual observations were fewer than these amounts because of filters applied such as requiring 500 or more square feet as described in the text associated with Table 1.
References
Bogin, A.N., & Shui, J. (2018). Appraisal accuracy, automated valuation models, and credit modeling in rural areas, Working Paper 18-03, Federal Housing Finance Agency.
Borst, R.A. (1992). Artificial neural networks: The next modelling/calibration technology for the assessment community. Artificial Neural Networks, 64–94.
Borst, R.A., & McCluskey, W.J. (1997). An evaluation of MRA, comparable sales analysis and ANNs for the mass appraisal of residential properties in Northern Ireland. Assessment Journal, 4(1), 47–55.
Brunsdon, C., Fotheringham, A.S., Charlton, M.E. (1996). Geographically weighted regression: A method for exploring spatial non-stationarity. Geographical Analysis, 28, 281–298.
Chiarazzo, V., Caggiani, L., Marinelli, M., Ottomanelli, M. (2014). A neural network based model for real estate price estimation considering environmental quality of property location. Transportation Research Procedia, 3, 810–817.
Hastie, T., Tibshirani, R., Friedman, J. (2009). The elements of statistical learning: data mining, inference, and prediction, 2nd edn. (Springer Series in Statistics).
LeSage, J.P., & Pace, R.K. (2009). Introduction to spatial econometrics, CRC Press.
McMillen, D.P. (1996). One hundred fifty years of land values in Chicago: A nonparametric approach. Journal of Urban Economics, 40, 100–124.
McMillen, D.P., & McDonald, J.F. (1997). A nonparametric analysis of employment density in a polycentric City. Journal of Regional Science, 37, 591–612.
Nguyen, N., & Cripps, A. (2001). Predicting housing value: A comparison of multiple regression analysis and artificial neural network. Journal of Real Estate Research, 22(3), 313–336.
Pace, R.K., & Gilley, O.W. (1998). Generalizing the OLS and grid estimators. Real Estate Economics, 26(2), 331–347.
Pace, R.K., & LeSage, J.P. (2004). Spatial autoregressive local estimation, Spatial econometrics and spatial statistics, (pp. 31–51). New York: Palgrave MacMillan.
Pace, R.K., & LeSage, J.P. (2008). A spatial Hausman test. Economics Letters, 101(3), 282–284.
Pace, R.K., Barry, R., Gilley, O.W., Sirmans, C.F. (2000). A method for spatialtemporal forecasting with an application to real estate prices. International Journal of Forecasting, 16(2), 229–246.
Pace, R.K., LeSage, J.P., Zhu, S. (2012). Spatial dependence in regressors and its effect on performance of likelihood-based and instrumental variable estimators, Advances in Econometrics, 30th Anniversary Edition. Emerald, 257–295.
Quang Do, A., & Grudnitski, G. (1993). A neural network analysis of the effect of age on housing values. Journal of Real Estate Research, 8(2), 253–264.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Pace, R.K., Hayunga, D. Examining the Information Content of Residuals from Hedonic and Spatial Models Using Trees and Forests. J Real Estate Finan Econ 60, 170–180 (2020). https://doi.org/10.1007/s11146-019-09724-w
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11146-019-09724-w