Skip to main content
Log in

Combining Property Price Predictions from Repeat Sales and Spatially Enhanced Hedonic Regressions

  • Published:
The Journal of Real Estate Finance and Economics Aims and scope Submit manuscript

Abstract

Hedonic regression and repeat sales are commonly used methods in real estate analysis. While the merits of combining these models when constructing house price indices are well documented, research on the utility of adopting the same approach for residential property valuation has not been conducted to date. Specifically, house value estimates were obtained by combining predictions from repeat sales and various hedonic regression specifications, which were enhanced to account for spatial effects. Three of these enhancements—regression kriging, mixed regressive-spatial autoregressive, and geographically weighted regression—are widely utilized spatial econometric models. However, a fourth augmentation, which addresses systematic residual patterns in regressions with district indicator variables and the presence of outliers in housing data, was also proposed. The resulting models were applied to a dataset containing 16,417 real estate transactions in Oslo, Norway, revealing that when the repeat sales approach is included, it reduces the median absolute percentage error of solely hedonic models by 6.8–9.5%, where greater improvements are associated with less accurate spatial enhancements. These improvements can be attributed to the inclusion of both spatial and non-spatial information inherent in previous sales prices. While the former has limited utility for well-specified spatial models, the non-spatial information that is implicit in previous sales prices likely captures otherwise difficult to observe phenomena, potentially making its contribution highly valuable in automated valuation models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. Extensive literature review has failed to uncover any publicly available research on this topic. However, some companies advertise automated valuation based on both models, e.g., Home Value Explorer® by FreddieMac (2017).

  2. FINN covers approximately 70% of the Norwegian housing market (Norge et al. 2017). All properties in the dataset employed in the current investigation were announced on the site.

  3. In Norway, cooperatives and apartment buildings can take on common debt, for example, to renovate the building. Especially for cooperatives, the common debt can be high compared with the transaction price of an apartment. The total price of a dwelling in Norway is the transaction price, plus the dwellings share of the total common debt.

  4. The term simple kriging is used when the mean of the dependent variable is assumed to be known a priori (Cressie 1990).

  5. Either administrative or generated by k-means, depending on the variable type required in the regression.

  6. By testing different weighting and combinations we found no single optimal solution for multiple performance metrics, resulting in our choice of a “trail-and-error” based weight of 60% for the regression estimate that resulted on both high prediction and low volatility across multiple runs.

  7. Generally, model performance is measured by median absolute percentage error (Q0.5).

  8. Row 13 in Figure 4 shows unsatisfactory performance by VRT where district variables are omitted.

References

  • Amundsen, B. (2015). Rike og fattige flytter fra hverandre i Oslo [Rich and poor moving apart from each other in Oslo]. Forskning.no, Available at: https://forskning.no/samfunnsgeografi/2015/06/rike-og-fattige-flytter-fra-hverandre-i-oslo (accessed: 11 December 2017).

  • Anon. (2013). OECD, Eurostat, International Labour Organization, International Monetary Fund, The World Bank, and United Nations Economic Commission for Europe. Repeat sales methods. Handbook on Residential Property Prices Indices (RPPIs), OECD publishing, Paris, 67–72.

  • Anselin, L. (2010). Thirty years of spatial econometrics. Papers in Regional Science, 89(1), 3–25.

    Google Scholar 

  • Bailey, M. J., Muth, R. F., & Nourse, H. O. (1963). A regression method for real estate price index construction. Journal of the American Statistical Association, 58(304), 933–942.

    Google Scholar 

  • Basu, S., & Thibodeau, T. G. (1998). Analysis of spatial autocorrelation in house prices. The Journal of Real Estate Finance and Economics, 17(1), 61–85.

    Google Scholar 

  • Bates, J. M., & Granger, C. W. (1969). The combination of forecasts. Journal of the Operational Research Society, 20(4), 451–468.

    Google Scholar 

  • Bohling, G. (2005). Kriging. Data Analysis in Engineering and Natural Science, Kansas Geological Survey. Available at: http://people.ku.edu/~gbohling/cpe940/Kriging.pdf (accessed: 8 June 2018).

  • Bowen, W. M., Mikelbank, B. A., & Prestegaard, D. M. (2001). Theoretical and empirical considerations regarding space in hedonic housing price model applications. Growth and Change, 32(4), 466–490.

    Google Scholar 

  • Can, A. (1992). Specification and estimation of hedonic housing price models. Regional Science and Urban Economics, 22(3), 453–474.

    Google Scholar 

  • Can, A., & Megbolugbe, I. (1997). Spatial dependence and house price index construction. The Journal of Real Estate Finance and Economics, 14(1–2), 203–222.

    Google Scholar 

  • Case, B., & Quigley, J. M. (1991). The dynamics of real estate prices. The Review of Economics and Statistics, 73(1), 50–58.

    Google Scholar 

  • Case, K. E., & Shiller, R. J. (1987). Prices of single-family homes since 1970: New indexes for four cities. In NBER working paper series 2393. National Bureau of Economic: Research. https://doi.org/10.3386/w2393.

    Chapter  Google Scholar 

  • Case, B., Pollakowski, H. O., & Wachter, S. M. (1991). On choosing among house price index methodologies. Real Estate Economics, 19(3), 286–307.

    Google Scholar 

  • Case, B., Clapp, J., Dubin, R., & Rodriguez, M. (2004). Modeling spatial and temporal house price patterns: A comparison of four models. The Journal of Real Estate Finance and Economics, 29(2), 167–191.

    Google Scholar 

  • Chan, Y. L., Stock, J. H., & Watson, M. W. (1999). A dynamic factor model framework for forecast combination. Spanish Economic Review, 1(2), 91–121.

    Google Scholar 

  • Clapp, J. M., Giaccotto, C., & Tirtiroglu, D. (1991). Housing price indices based on all transactions compared to repeat subsamples. Real Estate Economics, 19(3), 270–285.

    Google Scholar 

  • Clemen, R. T. (1989). Combining forecasts: A review and annotated bibliography. International Journal of Forecasting, 5(4), 559–583.

    Google Scholar 

  • Cleveland, W. S. (1979). Robust locally weighted regression and smoothing scatterplots. Journal of the American Statistical Association, 74(368), 829–836.

    Google Scholar 

  • Cliff, A. D., & Ord, K. (1970). Spatial autocorrelation: A review of existing and new measures with applications. Economic Geography, 46, 269–292.

    Google Scholar 

  • Corcoran, C., & Liu, F. (2014). Accuracy of Zillow’s home value estimates. Real Estate Issues, 39(1), 45–49.

    Google Scholar 

  • Cressie, N. (1990). The origins of kriging. Mathematical Geology, 22(3), 239–252.

    Google Scholar 

  • Dubin, R. A. (1988). Estimation of regression coefficients in the presence of spatially autocorrelated error terms. The Review of Economics and Statistics, 70(3), 466–474.

    Google Scholar 

  • Dubin, R. A. (1998). Spatial autocorrelation: a primer. Journal of Housing Economics, 7(4), 304–327.

    Google Scholar 

  • Eurostat. (2016). Distribution of population by tenure status, type of household and income group − EU-SILC survey. Available at: http://appsso.eurostat.ec.europa.eu/nui/show.do?datasetÆilc_lvho02&langÆen (accessed: 20 May 2018).

  • Farber, S., & Yeates, M. (2006). A comparison of localized regression models in a hedonic house price context. Canadian Journal of Regional Science, 29(3), 405–420.

    Google Scholar 

  • Fik, T. J., Ling, D. C., & Mulligan, G. F. (2003). Modeling spatial variation in housing prices: A variable interaction approach. Real Estate Economics, 31(4), 623–646.

    Google Scholar 

  • Fotheringham, A. S. (2009). Geographically weighted regression. In S. Fotheringham & P. Rogerson (Eds.), The SAGE handbook of spatial analysis, 243–254. London: SAGE Publications Ltd.

    Google Scholar 

  • Fotheringham, A. S., Brunsdon, C., & Charlton, M. (2002). Geographically weighted regression: The analysis of spatially varying relationships. New York: Wiley.

    Google Scholar 

  • FreddieMac. (2017). Home Value Explorer. Available at: http://www.freddiemac.com/hve/hve.html (accessed: 5 December 2017).

  • Geary, R. C. (1954). The contiguity ratio and statistical mapping. The Incorporated Statistician, 5(3), 115–146.

    Google Scholar 

  • Gelfand, A. E., Ghosh, S. K., Knight, J. R., & Sirmans, C. F. (1998). Spatio-temporal modeling of residential sales data. Journal of Business & Economic Statistics, 16(3), 312–321.

    Google Scholar 

  • Goovaerts, P. (1999). Using elevation to aid the geostatistical mapping of rainfall erosivity. Catena, 34(3–4), 227–242.

    Google Scholar 

  • de Haan, J. & E. Diewert (2013), "Hedonic Regression Methods”. Handbook on Residential Property Price Indices, OECD Publishing, Paris, https://doi.org/10.1787/9789264197183-7-en.

  • Harris, P., Fotheringham, A., Crespo, R., & Charlton, M. (2010). The use of geographically weighted regression for spatial prediction: An evaluation of models using simulated data sets. Mathematical Geosciences, 42(6), 657–680.

    Google Scholar 

  • Harris, P., Brunsdon, C., & Fotheringham, A. S. (2011). Links, comparisons and extensions of the geographically weighted regression model when used as a spatial predictor. Stochastic Environmental Research and Risk Assessment, 25(2), 123–138.

    Google Scholar 

  • Helbich, M., Brunauer, W., Hagenauer, J., & Leitner, M. (2013). Data-driven regionalization of housing markets. Annals of the Association of American Geographers, 103(4), 871–889.

    Google Scholar 

  • Hengl, T., Heuvelink, G. B. M., & Stein, A. (2003). Comparison of kriging with external drift and regression kriging. Technical note, ITC. Available at: https://webapps.itc.utwente.nl/librarywww/papers_2003/misca/hengl_comparison.pdf (accessed: 11 May 2018).

  • Hengl, T., Heuvelink, G. B. M., & Rossiter, D. G. (2007). About regression-kriging: From equations to case studies. Computers & Geosciences, 33(10), 1301–1315.

    Google Scholar 

  • Humberset, K. (2018). Her må du punge ut 85.000 kroner for én kvadratmeter [Here you have to pay 85,000 NOK for one square meter]. aftenposten.no. Available at: https://www.aftenposten.no/bolig/Her-ma-du-punge-ut-85000-kroner-for-n-kvadratmeter-10982b.html (accessed: 4 June 2018).

  • Kitanidis, P. K. (1993). Generalized covariance functions in estimation. Mathematical Geology, 25(5), 525–540.

    Google Scholar 

  • Koenker, R., & Bassett, G., Jr. (1978). Regression quantiles. Econometrica: Journal of the Econometric Society, 46(1), 33–50.

    Google Scholar 

  • Krause, A. L., & Bitter, C. (2012). Spatial econometrics, land values and sustainability: Trends in real estate valuation research. Cities, 29, S19–S25.

    Google Scholar 

  • LeSage, J. P., & Pace, R. K. (2014). The biggest myth in spatial econometrics. Econometrics, 2(4), 217–249.

    Google Scholar 

  • Levin, J. (2001). Information and the market for lemons. RAND Journal of Economics, 32(4), 657–666.

    Google Scholar 

  • Monsrud, I. J. & Takle, M. (2018). Price index for existing dwellings. Statistics Norway. Available at: https://www.ssb.no/en/priser-og-prisindekser/statistikker/bpi (accessed: 7 June 2018).

  • Moran, P. A. (1950). A test for the serial independence of residuals. Biometrika, 37(1/2), 178–181.

    Google Scholar 

  • Eiendom Norge, Eiendomsverdi & FINN.no. (2017). Eiendom Norges boligprisstatistikk [Real Estate Norway housing price statistics]. Available at: http://eiendomnorge.no/wpcontent/uploads/2017/11/Boligstatistikk_oktober_01.pdf (accessed: 4 December 2017).

  • Odeh, I. O. A., McBratney, A. B., & Chittleborough, D. J. (1995). Further results on prediction of soil properties from terrain attributes: Heterotopic kriging and regression kriging. Geoderma, 67(3–4), 215–226.

    Google Scholar 

  • Olaussen, J. O., Oust, A., & Solstad, J. T. (2017). Energy performance certificates–informing the informed or the indifferent? Energy Policy, 111, 246–254.

    Google Scholar 

  • Oslo Kommune. (2018). Finn bydelen din her [Find your city district here]. Available at: https://www.oslo.kommune.no/politikk-og-administrasjon/bydeler/bydelsvelger/ (accessed: 13 May 2018).

  • Pace, R. K., Barry, R., Gilley, O. W., & Sirmans, C. F. (2000). A method for spatial–temporal forecasting with an application to real estate prices. International Journal of Forecasting, 16(2), 229–246.

    Google Scholar 

  • Páez, A., Long, F., & Farber, S. (2008). Moving window approaches for hedonic price estimation: An empirical comparison of modelling techniques. Urban Studies, 45(8), 1565–1581.

    Google Scholar 

  • Park, B., & Bae, J. K. (2015). Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data. Expert Systems with Applications, 42(6), 2928–2934.

    Google Scholar 

  • Reichert, A. K. (1990). The impact of interest rates, income, and employment upon regional housing prices. The Journal of Real Estate Finance and Economics, 3(4), 373–391.

    Google Scholar 

  • Rosen, S. (1974). Hedonic prices and implicit markets: Product differentiation in pure competition. Journal of Political Economy, 82(1), 34–55.

    Google Scholar 

  • Sinnott, R. W. (1984). Virtues of the haversine. Sky and Telescope, 68(2), 159.

    Google Scholar 

  • Trawiński, B., Lasota, T., Kempa, O., Telec, Z., & Kutrzyński, M. (2017). Comparison of ensemble learning models with expert algorithms designed for a property valuation system. In N. Nguyen, G. Papadopoulos, P. Jędrzejowicz, B. Trawiński, & G. Vossen (Eds.), Computational collective intelligence, ICCCI 2017, lecture notes in computer science, vol 10448. Cham: Springer.

    Google Scholar 

  • Wheeler, D. C., & Calder, C. A. (2007). An assessment of coefficient accuracy in linear regression models with spatially varying coefficients. Journal of Geographical Systems, 9(2), 145–166.

    Google Scholar 

  • Yao, J., & Fotheringham, A. S. (2016). Local spatiotemporal modeling of house prices: A mixed model approach. The Professional Geographer, 68(2), 189–201.

    Google Scholar 

  • Yoo, S.-H. (2001). A robust estimation of hedonic price models: Least absolute deviations estimation. Applied Economics Letters, 8(1), 55–58.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Are Oust.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

Table 8 K-means results with varying k, compared to administrative districts
Table 9 Independent repeat sales results
Fig. 6
figure 6

Out-of-sample residuals from the hedonic regression without spatial enhancements or district indicators (referred to in Section 5 as the benchmark model) depicted on the map of Oslo. Lines represent administrative district boundaries, while marker color indicates residual value for each dwelling. Observations: 3284

Fig. 7
figure 7

Out-of-sample residuals from the hedonic regression with administrative district indicators depicted on the map of Oslo. Lines represent administrative district boundaries, whereas marker color indicates residual value for each dwelling. Observations: 3284

Fig. 8
figure 8

Out-of-sample residuals from the most accurate model that combines geographically weighted regression and repeat sales depicted on the map of Oslo. Lines represent administrative district boundaries, whereas marker color denotes residual value for each dwelling. Observations: 3284

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Oust, A., Hansen, S.N. & Pettrem, T.R. Combining Property Price Predictions from Repeat Sales and Spatially Enhanced Hedonic Regressions. J Real Estate Finan Econ 61, 183–207 (2020). https://doi.org/10.1007/s11146-019-09723-x

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11146-019-09723-x

Keywords

Navigation