Abstract
This paper employs classical bivariate, slab-and-spike variable selection, Bayesian semi-parametric shrinkage, and factor-augmented predictive regression models to forecast US real private residential fixed investment over an out-of-sample period from 1983Q1 to 2005Q4, based on in-sample estimates for 1963Q1–1982Q4. Both large-scale (188 macroeconomic series) and small-scale (20 macroeconomic series) slab-and-spike variable selection, and Bayesian semi-parametric shrinkage, and factor-augmented predictive regressions, as well as 20 bivariate regression models, capture the influence of fundamentals in forecasting residential investment. We evaluate the ex post out-of-sample forecast performance of the 26 models using the relative average mean square error for one-, two-, four-, and eight-quarter-ahead forecasts and test their significance based on the McCracken (2004, J Econom 140:719–752, 2007) mean-square-error F statistic. We find that, on average, the slab-and-spike variable selection and Bayesian semi-parametric shrinkage models with 188 variables provides the best forecasts among all the models. Finally, we use these two models to predict the relevant turning points of the residential investment, via an ex ante forecast exercise from 2006Q1 to 2012Q4. The 188 variable slab-and-spike variable selection and Bayesian semi-parametric shrinkage models perform quite similarly in their accuracy of forecasting the turning points. Our results suggest that economy-wide factors, in addition to specific housing market variables, prove important when forecasting in the real estate market.
This is a preview of subscription content, access via your institution.




Notes
The list of references to document the choice of these variables is available from the authors. The 20 variables include interest rates (3-month Treasury bill rate, 3TB), real gross domestic product (RGDP), the consumer price index (CPI), the unemployment rate (UNRATE), the labor force participation rate (LFPR), the 30-year conventional mortgage interest rate (MORTG), the business confidence index (BCON), the real house price index (RHP), the money supply (M1), real private consumption expenditure (RPCON), real government consumption expenditure (RGCON), the real change in private inventories (RCPINV), housing starts (HOUST), real non-residential fixed investment (RNRFINV), the Standard & Poor’s stock price index (S&P), retail sales (RSALES), new private housing units authorized by building permit (PERMIT), number of new houses sold (HSOLD), and the months’ supply of housing ratio (HSUPPLY).
The gamma distribution is a two-parameter family of continuous probability distributions on the positive real line, usually parameterized with (1) shape and scale parameters, (2) shape and inverse scale parameters, or (3) shape and mean parameters (SAS 2012). The inverse gamma distribution is a two-parameter family of continuous probability distributions on the positive real line, which is distributed as the reciprocal of a variable distributed according to the gamma distribution (SAS 2012). The beta distribution is a general statistical distribution that relates to the gamma distribution and contains two free parameters, often used as a prior distribution for binomial proportions in Bayesian analysis (Forbes et al. 2010).
The online Technical Appendix of Korobilis (2013b) details the MCMC method.
The Appendix contains a full description of all variables and the relevant stationarity transformations used.
Based on the suggestions of an anonymous referee, we also considered the performance of the double-differencing device (DDD) model proposed by Hendry (2006), as a possible benchmark. Since the AR(1) model consistently outperformed the DDD model at all horizons over the out-of-sample; however, we still chose the former as our benchmark. Complete details comparing the forecasting results of the AR(1) and DDD models are available from the authors.
The MSE-F statistic uses the loss differential as follows: \(MSE-F=(T-R-h+1)\left( {\bar{{d}}/MSE_1 } \right) \), where T equals the number of observations in the total sample, R equals number of observations used to estimate the model from which we calculate the first forecast (i.e., the in-sample portion of T), h equals the forecast horizon, \({\bar{d}}=M{\hat{S}}E_0 -M{\hat{S}}E_1\), \(M{\hat{S}}E_i =(T-R-h+1)^{-1}\sum _{t=R}^{T-h} {(u_{i,t+1})^{2}} \) with \(i=1, 0,\, M{\hat{S}}E_1\) corresponds to the MSE of the unrestricted model (i.e., the model with the relevant macroeconomic predictor variables), and \(M{\hat{S}}E_0\) corresponds to the MSE of the restricted model (i.e., the AR(1)-benchmark model). A significant MSE-F statistic indicates that the unrestricted model forecasts are statistically more accurate than those of the restricted model. Note, however, that the MSE-F statistic exhibits a non-standard and non-pivotal limiting distribution in the case of nested models and \(h>1\). Hence, we base our inferences on the bootstrap procedure described in detail in (Rapach et al. 2005; Rapach and Wohar 2006). These two papers provide further details.
We recursively update the ex post forecasts in-sample in the forecasting equation to generate the multi-step-ahead forecasts, whereas we produce the ex ante multi-step-ahead forecasts from a specific point in time (generally, from the end-point of data available on the predictors, which in our case is 2006:Q1-2012:Q4) without updating the parameter estimates. The ex ante forecasts give an objective statistical method (approach) to choose the best performing models, which, in turn, we use to predict the turning points.
As indicated earlier, the structure of the macroeconomic data commonly used by macroeconomists frequently involves highly correlated variables, which the SSVS model does not account for, but is incorporated in the BSS modeling approach. While, on average, we observe the gains from using the large-scale BSS approach over the corresponding SSVS approach, in our case, these two models produce statistically similar accuracy in forecasting the real private fixed residential investment.
An earlier version of the paper considered an extended out-of-sample horizon covering 1983:Q1-2011:Q2. Based on this out-of-sample period, the SSVS-Large model performed the best, on average, followed by the bivariate predictive regression model consisting of H4SALE. When we used these two models to compare the turning points over the out-of-sample period, we found that the forecast from the H4SALE model was more volatile relative to the SSVS-Large model, and in general, the SSVS-Large model tracked the turning points reasonably well, except during the recent crisis.
References
Alessi L, Barigozzi M, Capasso M (2010) Improved penalization for determining the number of factors in approximate factor models. Stat Probab Lett 80:1806–1813
Baghestani H (2011) Federal reserve and private forecasts of growth in investment. J Econ Bus 63:290–305
Bai J, Ng S (2002) Determining the number of factors in approximate factor models. Econometrica 70:191–221
Berger-Thomas L, Ellis L (2004) Housing construction cycles and interest rates. Research Discussion Paper 2004–08, Reserve Bank of Australia. http://www.rba.gov.au/publications/rdp/2004/2004-08.html. Accessed 25 Mar 2015
Bernanke BS (2009) On the outlook for the economy and policy. Speech at the Economic Club of New York, New York, New York. http://www.federalreserve.gov/newsevents/speech/bernanke20091116a.htm. Accessed 20 Mar 2013
Bernanke BS (2012) The economic recovery and economic policy. Speech at the New York Economic Club, New York, New York. http://www.federalreserve.gov/newsevents/speech/bernanke20121120a.htm. Accessed 5 Apr 2013
Brayton F, Tinsley P (1996) A guide to FRB/US: a macroeconomic model of the United States. Federal Reserve Board Finance and Economics Discussion Series. http://www.federalreserve.gov/pubs/feds/1996/199642/199642abs.html Accessed 20 Mar 2013
Browne LE (2000) National and regional housing patterns. N Engl Econ Rev 4:32–57
Choy LHT, Ho WKO, Mak SWK (2011) Region-specific estimates of the determinants of residential investment. J Urban Plan D-ASCE 137:1–6
Clements M, Hendry D (1998) Forecasting economic time series. Cambridge University Press, New York
Cremers M (2002) Stock return predictability: a Bayesian model selection perspective. Rev Financ Stud 15:1223–1249
Daly JS (2008) Overcoming barriers to institutional investment in residential property. Greater London Authority, Economics Working Paper, No. 29. http://london.gov.uk/mayor/economic_unit/docs/wp_29.pdf. Accessed 20 Apr 2013
Demers F (2005) Modelling and forecasting housing investment: the case of Canada. Bank of Canada, Ottawa, Working Paper 2005–41, December 2005. http://www.bankofcanada.ca/wp-content/uploads/2010/02/wp05-41.pdf. Accessed 25 Apr 2013
Diebold FX, Mariano RS (1995) Comparing predictive accuracy. J Bus Econ Stat 13:253–263
Dua P, Nishita R, Sahoo S (2008) Forecasting interest rates in India. Margin J Appl Econ Res 2:1–41
Dunson DB, Herring AH, Engel SM (2008) Bayesian selection and clustering of polymorphisms in functionally related genes. J Am Stat Assoc 103:534–546
Dynan K, Elmendorf D, Sichel D (2006) Can financial innovation help to explain the reduced volatility of economic activity? J Monet Econ 53:123–150
Edge RM (2000) The effect of monetary policy on residential and structures investment under differential project planning and completion times. International Finance Discussion Paper No. 671, Board of Governors of the Federal Reserve System, Washington, DC, US. http://www.federalreserve.gov/pubs/ifdp/2000/671/ifdp671.pdf. Accessed 28 Mar 2013
Egebo T, Richardson P, Lienert I (1990) A model of housing investment for the major OECD economies. OECD Econ Stud 14:151–188
Ferguson TS (1973) A Bayesian analysis of some nonparametric problems. Ann Stat 1(2):209–230
Fisher JDM., Gervais M (2007) First-time home buyers and residential investment volatility. Federal Reserve Bank of Chicago, 2007. Working paper 2007–15. https://www.chicagofed.org/publications/working-papers/2007/wp-15. Accessed 22 Mar 2013
Forbes C, Evans M, Hastings N, Peacock B (2010) Beta distribution, in statistical distributions, 4th edn. Wiley, Hoboken. doi:10.1002/9780470627242.ch8
Green RK (1997) Follow the leader: how changes in residential and non-residential investment predict changes in GDP. Real Estate Econ 25:253–270
Harvey DI, Leybourne SJ, Newbold P (1997) Testing the equality of prediction mean squared errors. Int J Forecast 13:281–291
Hendry DF (2006) Robustifying forecasts from equilibrium-correction systems. J Econom 135:399–426
King RG, Watson MW (2012) Inflation and labour costs. http://www.bu.edu/econ/files/2010/11/King_Watson_GERZ25_Jan26_201211.pdf. Accessed 10 Jan 2013
Kohn DL (2009) The economic outlook. Speech at the National Association for Business Economics, St. Louis, Missouri. http://www.federalreserve.gov/newsevents/speech/kohn20091013a.htm. Accessed 15 Apr 2013
Koop G, Korobilis D (2011) UK macroeconomic forecasting with many predictors: which models forecast best and when do they do so? Econ Model 28:2307–2318
Koop G, Potter S (2004) Forecasting in dynamic factor models using Bayesian model averaging. Econom J 7:550–565
Korobilis B (2013a) Bayesian forecasting with highly correlated predictors. Econ Lett 118:148–150
Korobilis B (2013b) Hierarchical shrinkage priors for dynamic regressions with many predictors. Int J Forecast 29:43–59
Krainer J (2006) Residential investment over the real estate cycle. Federal Reserve Bank of San Francisco Economic Letter, Number 2006–15, June 30. http://www.frbsf.org/economic-research/publications/economic-letter/2006/june/residential-investment-over-the-real-estate-cycle/. Accessed 28 Mar 2013
Leamer EE (2007) Housing is the business cycle. Fed Bank KS, pp 149–232. https://www.kansascityfed.org/publicat/sympos/2007/pdf/leamer_0415.pdf. Accessed 30 Mar 2013
Lunsford K (2015) Forecasting residential investment in the United States. Int J Forecast 31:276–285
MacLehose RF, Dunson DB, Herring AH, Hoppin JA (2007) Bayesian methods for highly correlated exposure data. Epidemiology 18:199–207
McCarthy J, Peach RW (2002) Monetary policy transmission to residential investment. FRBNY Economic Policy Review, May, pp 139–158
McCracken MW (2004) Asymptotics for out of sample tests of granger causality. Manuscript, University of Missouri, Columbia. ftp://ftp.soc.uoc.gr/students/aslanidis/Nektarios/Andrea/McCracken%20(2004).pdf. Accessed 10 Jan 2013
McCracken MW (2007) Asymptotics for out-of-sample tests of Granger causality. J Econom 140:719–752
Mitchell TJ, Beauchamp JJ (1988) Bayesian variable selection in linear regression. J Am Stat Assoc 83:1023–1032
Peek J, Wilcox J (2006) Housing, credit constraints, and macro stability: the secondary mortgage market and reduced cyclicality of residential investment. Am Econ Rev 96:135–140
Rapach DE, Wohar ME, Rangvid J (2005) Macro variables and international stock return predictability. Int J Forecast 21:137–166
Rapach DE, Wohar ME (2006) In-sample versus out-of-sample tests of stock return predictability in the context of data mining. J Empir Financ 13:231–247
SAS Institute (2012) Gamma and inverse-gamma distributions. SAS/STAT(R) 12.1 User’s Guide. SAS Institute Inc. http://support.sas.com/documentation/cdl/en/statug/65328/PDF/default/statug.pdf. Accessed 12 Apr 2013
Stock J, Watson M (2002) Macroeconomic forecasting using diffusion indexes. J Bus Econ Stat 20:147–162
Shotwell MS, Slatey EH (2011) Bayesian outlier detection with Dirichlet Process mixtures. Bayesian Anal 6(4):665–690
Yellen JL (2013) A painfully slow recovery for America’s workers: causes, implications, and the federal reserve’s response. Speech at the “A Trans-Atlantic agenda for shared prosperity” conference, Washington, D.C., USA, 11 February, 2013. http://www.federalreserve.gov/newsevents/speech/yellen20130211a.pdf. Accessed 15 Apr 2013
Author information
Authors and Affiliations
Corresponding author
Additional information
We would like to thank two anonymous referees and an Associate Editor for many helpful comments. However, any remaining errors are solely ours.
Appendix
Rights and permissions
About this article
Cite this article
Aye, G.C., Miller, S.M., Gupta, R. et al. Forecasting US real private residential fixed investment using a large number of predictors. Empir Econ 51, 1557–1580 (2016). https://doi.org/10.1007/s00181-015-1059-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00181-015-1059-z
Keywords
- Private residential investment
- Predictive regressions
- Factor-augmented models
- Bayesian shrinkage
- Forecasting
JEL Classification
- C32
- E22
- E27