Modeling and elucidation of housing price

  • Fei Tan
  • Chaoran Cheng
  • Zhi WeiEmail author


It is widely acknowledged that the value of a house is the mixture of a large number of characteristics. House price prediction thus presents a unique set of challenges in practice. While a large body of works are dedicated to this task, their performance and applications have been limited by the shortage of long time span of transaction data, the absence of real-world settings and the insufficiency of housing features. To this end, a time-aware latent hierarchical model is developed to capture underlying spatiotemporal interactions behind the evolution of house prices. The hierarchical perspective obviates the need for historical transaction data of exactly same houses when temporal effects are considered. The proposed framework is examined on a large-scale dataset of the property transaction in Beijing. The whole experimental procedure strictly conforms to the real-world scenario. The empirical evaluation results demonstrate the outperformance of our approach over alternative competitive methods. We also group housing features into both external and internal clusters. The further experiment unveils that external component shapes house prices much more heavily than the internal one does. More interestingly, the inference of latent neighborhood value in our model is empirically shown to be able to lessen the dependence on the critical external cluster of features in house price prediction.


House prices Spatiotemporal effects Internal component External component Neighborhood value 



  1. Ahearne AG, Ammer J, Doyle BM, Kole LS, Martin RF (2005) House prices and monetary policy: a cross-country study. In: International finance discussion papers 841Google Scholar
  2. Bailey MJ, Muth RF, Nourse HO (1963) A regression method for real estate price index construction. J Am Stat Assoc 58(304):933–942CrossRefGoogle Scholar
  3. Baral R, Li T (2017) Exploiting the roles of aspects in personalized poi recommender systems. Data Min Knowl Discov 32:320–343MathSciNetCrossRefGoogle Scholar
  4. Besag J (1986) On the statistical analysis of dirty pictures. J R Stat Soc Ser B 48(3):259–302MathSciNetzbMATHGoogle Scholar
  5. Boyd S, Vandenberghe L (2004) Convex optimization. Cambridge University Press, CambridgeCrossRefzbMATHGoogle Scholar
  6. Can A (1990) The measurement of neighborhood dynamics in urban house prices. Econ Geogr 66(3):254–272CrossRefGoogle Scholar
  7. Case B, Pollakowski HO, Wachter SM (1991) On choosing among house price index methodologies. Real Estate Econ 19(3):286–307CrossRefGoogle Scholar
  8. Case B, Clapp J, Dubin R, Rodriguez M (2004) Modeling spatial and temporal house price patterns: a comparison of four models. J Real Estate Finance Econ 29(2):167–191CrossRefGoogle Scholar
  9. Case KE, Shiller RJ (1989) The efficiency of the market for single-family homes. Am Econ Rev 79(1):125–137Google Scholar
  10. Case KE, Shiller RJ et al (1987) Prices of single-family homes since 1970: new indexes for four cities. N Engl Econ Rev (Sept/Oct):45–56Google Scholar
  11. Chopra S, Thampy T, Leahy J, Caplin A, LeCun Y (2007) Discovering the hidden structure of house prices with a non-parametric latent manifold model. In: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 173–182Google Scholar
  12. De Bruyne K, Van Hove J (2013) Explaining the spatial variation in housing prices: an economic geography approach. Appl Econ 45(13):1673–1689CrossRefGoogle Scholar
  13. Deng D, Shahabi C, Demiryurek U, Zhu L, Yu R, Liu Y (2016) Latent space model for road networks to predict time-varying traffic. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1525–1534Google Scholar
  14. Fu Y, Ge Y, Zheng Y, Yao Z, Liu Y, Xiong H, Yuan J (2014a) Sparse real estate ranking with online user reviews and offline moving behaviors. In: Data mining (ICDM), 2014 IEEE international conference on. IEEE, pp 120–129Google Scholar
  15. Fu Y, Xiong H, Ge Y, Yao Z, Zheng Y, Zhou ZH (2014b) Exploiting geographic dependencies for real estate appraisal: a mutual perspective of ranking and clustering. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1047–1056Google Scholar
  16. Fu Y, Liu G, Papadimitriou S, Xiong H, Ge Y, Zhu H, Zhu C (2015) Real estate ranking via mixed land-use latent models. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 299–308Google Scholar
  17. Fu Y, Xiong H, Ge Y, Zheng Y, Yao Z, Zhou ZH (2016) Modeling of geographic dependencies for real estate ranking. ACM Trans Knowl Discov Data 11(1):11CrossRefGoogle Scholar
  18. Gelfand AE, Ecker MD, Knight JR, Sirmans C (2004) The dynamics of location in home price. J Real Estate Finance Econ 29(2):149–166CrossRefGoogle Scholar
  19. Goetzmann WN, Peng L (2002) The bias of the RSR estimator and the accuracy of some alternatives. Real Estate Econ 30(1):13–39CrossRefGoogle Scholar
  20. Goodman AC (1978) Hedonic prices, price indices and housing markets. J Urban Econ 5(4):471–484CrossRefGoogle Scholar
  21. Gu Z, Gu L, Eils R, Schlesner M, Brors B (2014) Circlize implements and enhances circular visualization in R. Bioinformatics 30(19):2811–2812CrossRefGoogle Scholar
  22. Hyndman RJ, Koehler AB (2006) Another look at measures of forecast accuracy. Int J Forecast 22(4):679–688CrossRefGoogle Scholar
  23. Jiang S, Ferreira J, González MC (2012) Clustering daily patterns of human activities in the city. Data Min Knowl Discov 25:478–510MathSciNetCrossRefzbMATHGoogle Scholar
  24. Liu B, Mavrin B, Niu D, Kong L (2016) House price modeling over heterogeneous regions with hierarchical spatial functional analysis. In: Data mining (ICDM), 2016 IEEE 16th international conference on. IEEE, pp 1047–1052Google Scholar
  25. Lü L, Zhou T (2011) Link prediction in complex networks: a survey. Physica A 390(6):1150–1170CrossRefGoogle Scholar
  26. Meese R, Wallace N (1991) Nonparametric estimation of dynamic hedonic price models and the construction of residential housing price indices. Real Estate Econ 19(3):308–332CrossRefGoogle Scholar
  27. Nagaraja CH, Brown LD, Zhao LH (2011) An autoregressive approach to house price modeling. Ann Appl Stat 5(1):124–149MathSciNetCrossRefzbMATHGoogle Scholar
  28. Pace RK, Barry R, Clapp JM, Rodriquez M (1998) Spatiotemporal autoregressive models of neighborhood effects. J Real Estate Finance Econ 17(1):15–33CrossRefGoogle Scholar
  29. Pace RK, Barry R, Gilley OW, Sirmans C (2000) A method for spatial-temporal forecasting with an application to real estate prices. Int J Forecast 16(2):229–246CrossRefGoogle Scholar
  30. Peterson S, Flanagan A (2009) Neural network hedonic pricing models in mass real estate appraisal. J Real Estate Res 31(2):147–164Google Scholar
  31. Sangalli LM, Ramsay JO, Ramsay TO (2013) Spatial spline regression models. J R Stat Soc Ser B (Stat Methodol) 75(4):681–703MathSciNetCrossRefGoogle Scholar
  32. Shiller RJ (1991) Arithmetic repeat sales price estimators. J Hous Econ 1(1):110–126CrossRefGoogle Scholar
  33. Smith TE, Wu P (2009) A spatio-temporal model of housing prices based on individual sales transactions over time. J Geogr Syst 11(4):333CrossRefGoogle Scholar
  34. Tan F, Xia Y, Zhu B (2014) Link prediction in complex networks: a mutual information perspective. PLOS ONE 9(9):e107,056CrossRefGoogle Scholar
  35. Tan F, Cheng C, Wei Z (2016) Modeling real estate for school district identification. In: Data mining (ICDM), 2016 IEEE 16th international conference on. IEEE, pp 1227–1232Google Scholar
  36. Tan F, Cheng C, Wei Z (2017) Time-aware latent hierarchical model for predicting house prices. In: Data mining (ICDM), 2017 IEEE 16th international conference on. IEEE, pp 1111–1116Google Scholar
  37. Tan F, Du K, Wei Z, Liu H, Qin C, Zhu R (2018) Modeling item-specific effects for video click. In: Proceedings of the 2018 SIAM international conference on data mining. SIAM, pp 639–647Google Scholar
  38. Taylor LO (2003) The hedonic method. In: A primer on nonmarket valuation, pp 331–393Google Scholar
  39. Yao Z, Fu Y, Liu B, Xiong H (2016) The impact of community safety on house ranking. In: Proceedings of the 2016 SIAM international conference on data mining. SIAM, pp 459–467Google Scholar
  40. Zhou J, Wang F, Hu J, Ye J (2014) From micro to macro: data driven phenotyping by densification of longitudinal electronic medical records. In: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 135–144Google Scholar
  41. Zhu H, Xiong H, Tang F, Liu Q, Ge Y, Chen E, Fu Y (2016) Days on market: Measuring liquidity in real estate markets. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 393–402Google Scholar

Copyright information

© The Author(s), under exclusive licence to Springer Science+Business Media LLC, part of Springer Nature 2019

Authors and Affiliations

  1. 1.Department of Computer ScienceNew Jersey Institute of TechnologyNewarkUSA

Personalised recommendations