1 Introduction

Chinese cities are categorized into four tiers, according to their size and economic influence. First-tier cities include Beijing, Shanghai, Guangzhou and Shenzhen, while second-tier cities mostly consist of provincial capitals such as Nanjing and Hangzhou. Third- and fourth-tier cities are smaller, often newly developed urban areas. The top-tier cities have experienced overheating in housing demand, and as a result, inflated house prices, while the lower tiers have suffered from a supply glut and a subsequent spate of unsold houses. Figure 1 shows the housing price index (normalized as one at the first time period) increases over time in first-tier cities versus the national average. As a result, the average house in Shanghai would take the average worker 41 years to pay off versus 21 years in London. Home-buyers in Shanghai, especially those whose incomes fall in the bottom 10 percent of homeowners, take on extreme financial burdens to buy homes. Their mortgages are typically more than 10 times their annual disposable income. In an effort to slow down the speculative purchasing of real estate, various policies have been implemented. For instance, Shanghai introduced a property tax in 2011 and increased the minimum down payment requirement in November 2016. COVID-19 also broke out in December 2019. This paper uses monthly data from March 2009 to December 2021 to assess the impacts of increased minimum down payment requirements and the pandemic on Shanghai’s housing price index. This paper is related to studies on the mortgage and housing market (e.g.,Duan et al. 2021; Ye et al. 2014; Ricks 2021).

Fig. 1
figure 1

Housing price index comparison among first-tier cities and national average

The traditional approach to measuring the treatment effects is to assume the observed data are either subject to treatment or not (e.g.,Rosenbaum and Rubin 1983; Abadie et al. 2010; Hsiao et al. 2012). However, our data cover the period of no treatment (before November 2016), under the treatment (November 2016 to December 2019), and under the treatment and pandemic (January 2020 to December 2021). It provides a unique possibility to investigate the impact of the pandemic and the impact of increased minimum down payment requirements on the Shanghai housing price index separately. In this paper, we are interested in measuring the treatment effects, as well as considering methods to disentangle the impact of different treatments. The analysis relies on the Hsiao et al. (2012) (HCW) panel nonparametric regression methodFootnote 1 and the time-series ARIMA model.

The HCW method has several appealing features that have been documented in the literature. Bai et al. (2014) suggested that the HCW method can be extended to non-stationary time-series data. Li and Bell (2017) showed the HCW method’s robustness to regression functional form misspecification. Because of these advantages, the HCW method has been popular in analyzing treatment effects. Du and Zhang (2015) used their method to study the home-purchase restriction that was first started in Beijing in 2010, as well as the pilot property tax program implemented in Shanghai and Chongqing in 2011. They concluded that the home-purchase restriction reduced the annual growth rate of housing prices in Beijing by 7.69%. The trial property tax program reduced the annual growth rate of housing prices in Chongqing by 2.52%, while this program had no significant effects on the housing prices in Shanghai. Ke and Hsiao (2021) examined the 76-day COVID-19 lockdown policy placed on Hubei Province, the epicenter of COVID-19, in the first quarter of 2020. They concluded that this lockdown policy reduced the GDP of Hubei Province by 37% in the first quarter of 2020, and the majority of the economy quickly recovered after the government lifted the lockdown policy in April 2020. Other applications include Ouyang and Peng (2015), Ke et al. (2017), Du et al. (2021), Gao et al. (2019), and Li and Long (2018).

We present the data description in Sect. 2. Section 3 discusses the econometric modeling methodology. Section 4 provides the empirical analysis. Concluding remarks are in Sect. 5.

2 Data description

We use monthly housing price indices of 68 major Chinese cities from March 2009 to December 2021 downloaded from the Chinese National Bureau of Statistics. The housing price index for each of the cities considered is normalized to 100 for the year 2015. The treatment unit is Shanghai. We exclude the first-tier cities, Beijing, Guangzhou and Shenzhen, from the control group because they have experienced different policy changes during our sample periods. After the exclusion, the control group contains 64 cities. Table 1 presents the list of all cities in our dataset.

Table 1 City list

Shanghai in November 2016 raised the minimum down payment rate requirement from 30% to 35% for primary residence and from 60% to 70% for investment property. We let T denote the total number of time periods in the dataset. Let \(T_0\) denote November 2016. Let \(T_1\) denote December 2019, the date COVID-19 broke out. Thus, for \(t=1,2,\cdots ,T_0\), observed data contain neither treatment nor pandemic. For \(t=T_0+1,T_0+2,\cdots ,T_1\), Shanghai was under treatment, but in the absence of the pandemic, while other units received no treatment, nor under the pandemic. For \(t=T_1+1,T_1+2,\cdots ,T\), all units are under pandemic, while Shanghai remained under increased minimum down payment requirement policy, but not for other units. Figure 2 shows the housing price indices of all 68 cities.

Fig. 2
figure 2

Housing price indices of 68 cities

3 Methodology

To measure the treatment effects, we need to know the outcomes with or without receiving the treatment. However, rarely one has information about both. In other words, the counterfactuals for the missing outcomes need to be constructed in order to estimate the treatment effects. There are numerous methods being suggested (e.g., Chapter12, Hsiao 2022). In this paper, we adopt the panel nonparametric regression method suggested by Hsiao et al. (2012) (HCW).

Let \((y_t,x_{1t},x_{2t},\cdots ,x_{Nt})'\!=\!(y_t,\varvec{x}_t')'\) be the \((N\!+\!1)\) random vectors, \(t\!=\!1,2,\cdots ,T\). If \(y_t\) and \(\varvec{x}_t\) are cross-correlated, we can decompose \(y_t\) as

$$\begin{aligned} y_t=E(y_t|\varvec{x}_t)+\eta _t, \end{aligned}$$
(3.1)

where by construction \(\eta _t\perp \varvec{x}_t\). Hence one may predict \(y_t\) by \(E(y_t|\varvec{x}_t)\). HCW suggest to approximate \(E(y_t|\varvec{x}_t)\) by a linear function of \(\varvec{x}_t\), \(\mu +\varvec{a}'\varvec{x}_t\). However, \((\mu ,\varvec{a}')\) is unknown. It needs to be estimated. To balance the closeness of within-sample fit and the accuracy of post-sample prediction, various methods are suggested. For instance, HCW suggest using model selection criteria such as AIC (Akaike 1973, 1974) or AICC (Hurvich and Tsai 1989), Li and Bell (2017) suggest using Lasso (Tibshirani 1996), and Shi and Huang (2021) suggest the forward step-wise procedure.

To predict \(y_t\) in the absence of higher down payment requirement and pandemic, we use data before \(T_0\). We select the relevant subset of \(\varvec{x}_t\), \(\varvec{x}_t^*\), following Shi and Huang (2021) forward step-wise procedure. We start from a null model which includes no unit in the control group. At each step, we select one unit from the candidate control units and add it to the model. The model with the selected unit has the best in-sample fit compared with models with other candidate control units. The in-sample goodness of fit is measured using mean squared errors (MSE),

$$\begin{aligned} \frac{1}{T_0}\sum _{t=1}^{T_0}(y_t-\hat{y}_t)^2, \end{aligned}$$
(3.2)

where \(\hat{y}_t\) is the estimate of \(E(y_t|\varvec{x}_t)\).

The forward step-wise selection generates a path of models from a model with only one control unit to a model with all of the candidate control units. To evaluate the performance of each model on the path, we use out-of-sample forecasting accuracy as the criterion. Suppose we use the first k months as a training set and forecast the next h months (\(k+1\)-th to \(k+h\)-th month). Denote \(\hat{y}_{t,k}^m\) as the counterfactual from models trained with the first k time periods of the dataset using m out of N control units,

$$\begin{aligned} \hat{y}_{t,k}^m=\hat{\beta }_0+\hat{\beta }_1 x_{1t}+\cdots +\hat{\beta }_m x_{mt}. \end{aligned}$$
(3.3)

We calculate the forecasting MSE as

$$\begin{aligned} \frac{1}{h}\sum _{t=k+1}^{k+h}(y_t-\hat{y}_{t,k}^m)^2 \end{aligned}$$
(3.4)

The model with the smallest forecasting MSE is selected as the best model out of all models on the path. In the following procedures, we denote \(\hat{y}_t\) as the counterfactual generated by the best model estimated with the pre-treatment data.

Under the assumption that treatment or the absence of treatment in the treatment units has no effects on the control units, the so-identified model can be used to predict the outcomes of the treatment unit in the absence of treatment for the post-treatment periods. Although COVID-19 broke out in December 2019, as long as the impact of COVID-19 does not affect the fundamental relations between the treatment unit and control units in the absence of treatment, Equation (3.3) can still be used to predict the outcomes in the absence of treatment under the pandemic because all units are under the pandemic. Since the observed \(y_t\) are the outcomes under the joint impact of the treatment and the pandemic, the treatment effects for the post-treatment periods can still be estimated as

$$\begin{aligned} \Delta _t^1=y_t-\hat{y}_t, \hbox { for } t=T_0+1,T_0+2,\cdots ,T, \end{aligned}$$
(3.5)

independent of whether the pandemic broke out or not in December 2019.

The observed housing price indices in the post-pandemic period represent the outcomes under both the pandemic and the increased minimum down payment requirement. They provide the possibility to separately identify the impact of the pandemic, provided that the counterfactuals in the absence of the pandemic and treatment can be constructed. One way to predict such outcomes is to use pre-treatment and pre-pandemic data \((1,2,\cdots ,T_0)\) to identify and estimate a univariate time-series model, say the Box and Jenkins (1976) type autoregressive integrated moving average model (ARIMA) for the units considered, then use the identified model to predict the outcomes for periods \(T_1+1, T_1+2,\cdots ,T\). However, the treatment started in November 2016 and the pandemic broke out in December 2019. The constructed pre-treatment and pre-pandemic time-series models would not make use of the most recent 37 time-series observations before the breakout of COIVD-19. Moreover, it is well known that a univariate (ARIMA) model in general predicts short-run outcomes well, but not necessarily so for long-run predictions. In our case, the prediction involves predicting 38, 39, \(\cdots \), periods ahead.

To increase the degrees of freedom and reduce the prediction horizon, we note that Eq. (3.3) generates predictions in the absence of treatment for post-pandemic periods because \(\varvec{x}_t\) are the outcomes under pandemic but in the absence of treatment. If we replace \(\varvec{x}_t\) by the outcomes in the absence of treatment and pandemic, denoted as \(\tilde{\varvec{x}}_t\), then

$$\begin{aligned} \tilde{y}_t=\hat{\beta }_0+\hat{\beta }_1 \tilde{x}_{1t}+\cdots +\hat{\beta }_m \tilde{x}_{mt}, \ \ t=T_1+1,T_1+2,\cdots ,T, \end{aligned}$$
(3.6)

can give us the counterfactual paths in the absence of treatment and pandemic. We note that \(\hat{y}_t\) for \(t=T_1+1,T_1+2,\cdots ,T\), are the predicted outcomes in the absence of treatment but under the pandemic, then the pandemic impact can be estimated as

$$\begin{aligned} \Delta _t^2=\hat{y}_t-\tilde{y}_t, \ \ t=T_1+1,T_1+2,\cdots ,T. \end{aligned}$$
(3.7)

Since COVID-19 started in December 2019, all control units before this date were not subject to treatment or pandemic, we suggest using Box and Jenkins (1976) identification, estimation and prediction method to predict \(\tilde{x}_{it}\) for \(t=T_1+1,T_1+2,\cdots ,T\), then use Eq. (3.6) to construct \(\tilde{y}_t\) for \(t=T_1+1,T_1+2,\cdots ,T\). The training dataset contains all pre-pandemic time periods, including the 37 time periods between the treatment and the pandemic. Therefore, the prediction involves predicting 1, 2, \(\cdots \), periods ahead.

Table 2 The parameter estimates of two HCW forecasting models
Fig. 3
figure 3

Counterfactual estimated by the HCW method with forecasting window of 12

Fig. 4
figure 4

Counterfactual estimated by the HCW method with forecasting window of 24

Table 3 The comparison between actual and HCW predicted values with forecasting window of 12
Table 4 The comparison between actual and HCW predicted values with forecasting window of 24
Fig. 5
figure 5

Counterfactual estimated by the synthetic control method

Fig. 6
figure 6

Counterfactual estimated by the HCW method for small-sized houses

Fig. 7
figure 7

Counterfactual estimated by the HCW method for medium-sized houses

Fig. 8
figure 8

Counterfactual estimated by the HCW method for large-sized houses

4 Empirical results

4.1 The treatment effect before the pandemic

We first consider the time periods before the pandemic of 2019. Let \(y_t\) be the treatment city’s housing price index at time period t, and \(x_{it}\) be the i-th control city’s housing price index at time period t. Denote the number of control cities as N. We use the Shi and Huang (2021) forward step-wise selection rule to select the model to generate counterfactual paths for \(\hat{y}_t\), \(t=T_0+1,T_0+2,\cdots ,T\). We use two sets of time windows to measure the out-of-sample forecasting windows, \(h=12\) or 24. For \(h=12\), we use \(k=40,50,60,70,80\) months as a training set and forecast the next 12 months (\(k+1\)-th to \(k+12\)-th month). Denote \(\hat{y}_{t,k}\) as the counterfactual from models trained with the first k time periods of the dataset. We calculate the forecasting MSE as

$$\begin{aligned} \sum _{k\in \{40,50,60,70,80\}}\frac{1}{12}\sum _{t=k+1}^{k+12}(y_t-\hat{y}_{t,k})^2 \end{aligned}$$
(4.1)

The model with the smallest forecasting MSE is selected as the best model out of all models on the path. The best model contains 12 cities: Changchun, Nanjing, Hefei, Xiamen, Baotou, Jinzhou, Yichang, Shaoguan, Beihai, Luzhou, Nanchong, and Kunming. For \(h=24\), we use the first \(k=40,50,60\) months as a training set and forecast the next 24 months (\(k+1\)-th to \(k+24\)-th month). We calculate the forecasting MSE as

$$\begin{aligned} \sum _{k\in \{40,50,60\}}\frac{1}{24}\sum _{t=k+1}^{k+24}(y_t-\hat{y}_{t,k})^2 \end{aligned}$$
(4.2)

With these settings, the best model contains 11 cities: Changchun, Nanjing, Hefei, Xiamen, Baotou, Jinzhou, Yichang, Shaoguan, Luzhou, Nanchong, and Kunming. These 11 cities are identical to the control cities selected based on Eq. (4.1) except that the city "Beihai" is removed. It is worth noting that the selected control units are unlikely to be affected by the policy spillover because they are all far away from Shanghai and not experiencing the overheating as Shanghai.

Table 5 The comparison between actual and HCW predicted values for small-sized houses
Table 6 The comparison between actual and HCW predicted values for medium-sized houses
Table 7 The Comparison between actual and HCW predicted values for large-sized houses

We will call the method using the 12 selected control cities based on Eq. (4.1) as Model 1, and the method using the 11 selected control cities based on Eq. (4.2) as Model 2. Table 2 provides the estimation results of Model 1 and Model 2. Figures 3 and 4 plot the counterfactual estimated by the models. As one can see there is hardly any difference in the predicted counterfactual between the two models. The estimated difference between the actual and predicted counterfactual paths are reported in Tables 3 and 4, respectively. The average treatment effect is \(-\)8.17% and \(-\)8.13% on the price index over 36 months, respectively, with hardly any noticeable difference. Tables 3 and 4 also report the prediction standard errors for the HCW predicted outcomes using the formulas for constructing prediction error variance as in Fujiki and Hsiao (2015). The standard errors indicate that the differences between the actual and predicted values are significant at the 5% level.

Fig. 9
figure 9

Counterfactual estimated by the TS method with forecasting window of 12

Table 8 The comparison between actual and TS predicted values with forecasting window of 12
Fig. 10
figure 10

Counterfactual estimated by the TS method with forecasting window of 24

Table 9 The Comparison between actual and TS predicted values with forecasting window of 24
Table 10 Estimation and diagnosis results of ARIMA models

4.2 Comparison with the synthetic control method

For comparison purposes, in addition to the HCW method, we also estimate the treatment effects before the pandemic using the synthetic control method (SCM) following Abadie and Gardeazabal (2003) and Abadie et al. (2010). Both the HCW and SCM are non-causal, nonparametric methods to construct counterfactuals. However, the HCW method places no restrictions on the coefficients of predictors while SCM restricts the coefficients to be nonnegative and sum to 1. Moreover, the HCW method allows the presence of intercept while the SCM does not. Constructing counterfactuals is essentially an issue of prediction. In principle, whatever variables are not affected by the treatment but help prediction should be included and there should be no restrictions on the sign and value of the coefficients (Hsiao and Wan 2014). Moreover, predictive models including intercepts typically yield more accurate predictions than predictive models not allowing intercepts, not to mention that intercepts in the predictive models can be considered as capturing the unit-specific effects (Hsiao et al. 2012). Wan et al. (2018) have conducted Monte Carlo studies to compare the accuracy of predictions by the HCW and SCM methods in a number of data generating processes and found that in general HCW method yields more accurate predictions. This is also the case here.

From the counterfactuals constructed by the synthetic control method reported in Fig. 5, we observed that it does not track the treatment unit very well in the pre-treatment periods. The forecasting MSE using a forecasting window of 12 as described in Eq. (4.1) is 2440.5 for the synthetic control method, and 263.4 for the HCW method. Given its unsatisfactory performance in forecasting MSE, the estimation of the treatment effects does not seem reliable. The SCM fits the in-sample data (for \(t\le T_0\)) poorly because of two reasons: (i) The SCM fitted curve (for \(t\le T_0\)) has a sample mean above the real data. Since the SCM does not allow for an intercept, it cannot move the estimated curve downward to improve the fit. (ii) The SCM fitted curve has a flatter slope than the real data. Therefore, the parallel trend assumption (during the pre-treatment periods) is violated. This is because the SCM weights are restricted to sum to one. Since all control cities’ price curves have flatter upward trends, no SCM curve can match the upward trend of Shanghai’s price index. Due to these two reasons, the SCM counterfactual is biased toward finding zero treatment effects.

4.3 Heterogeneous treatment effects on house size

The increased down payment rate suppresses the overall demand in the housing market. As suggested by a referee, there could be demand shifting among market segments. To explore heterogeneous treatment effects on house size, we obtain the segment-wise housing price indices for small (area \(<90\text {m}^2\)), medium (\(90\text {m}^2<\) area \(<144\text {m}^2\)) and large (area \(>144\text {m}^2\)) houses. These price indices started in December 2010. Therefore, the time series is slightly shorter than the overall housing price index. We redo the estimation for the three housing market segments. The corresponding results are reported in Figs. 6, 7 and 8 and Tables 5, 6 and 7. The average treatment effects for small, medium and large houses are \(-\)5.68%, \(-\)9.16%, and \(-\)17.04%, respectively. For the small house price index, during the first 13 months after the treatment, the estimated treatment effects are mostly insignificant with mixed signs. We observe a similar phenomenon for the medium house price index but it lasts much shorter, for 3 months. After these time periods, the estimated treatment effects are negative and mostly significant at the 5% level for both the small and medium house price indices. For the large house price index, the estimated treatment effects are negative and significant from the first month after the treatment. The magnitude of the treatments is much larger than that of small and medium houses. These empirical results suggest that there is demand shifting from larger houses to smaller houses due to the increased down payment rate.

4.4 Separating the treatment effect and the pandemic effect

For the pandemic time periods, the control cities are affected by the pandemic. Using the HCW predicted \(\hat{y}_t\) obtained by Eq. (3.3) reflects the counterfactual of Shanghai housing price index without the treatment but under the pandemic. To predict the counterfactuals without the pandemic and without the increase in the minimum down payment requirement, as well as to reduce the potential distortions due to ignoring the time-series information from November 2016 to December 2019, we opt to construct ARIMA models for each control unit using data from pre-December 2019, and then use the estimated models to generate predictions for the control units for the post-pandemic periods, which reflect the scenarios without pandemic and without treatment. Denote each of such predicted values by \(\tilde{x}_{it}\). The ARIMA models for each \(\tilde{x}_{it}\) for the used control units are reported in the appendix. We use \(\tilde{x}_{it}\) to construct the counterfactual for Shanghai in the absence of pandemic and treatment using Eq. (3.6). Then the impact of the pandemic is estimated as

$$\begin{aligned} \Delta _t^2=\hat{y}_t-\tilde{y}_t, \hbox { for } t=T_1+1,T_1+2,\cdots ,T, \end{aligned}$$
(4.3)

and the impact of the increased minimum down payment requirement under the pandemic is

$$\begin{aligned} \Delta _t^1=y_t-\hat{y}_t, \hbox { for } t=T_1+1,T_1+2,\cdots ,T. \end{aligned}$$
(4.4)

For the control group selected by the forward step-wise selection with a forecasting window of 12, Fig. 9 shows the paths of \(\hat{y}_t\) (HCW counterfactual) and \(\tilde{y}_t\) (TS counterfactual), and Table 8 shows the corresponding month-by-month values. Figure 10 and Table 9 report the results for the control group selected by the forward step-wise selection with a forecasting window of 24. The difference between the counterfactuals predicted by TS and HCW indicates that the pandemic has a negligible impact on Shanghai’s housing price index. On the other hand, by comparing the difference between Actual and HCW after December 2019, we observe a lasting impact of increased minimum down payment requirement. It reduces the Shanghai housing price index by about 8%.

5 Concluding remarks

Our empirical analysis showed that raising the minimum down payment requirement appeared to be an effective policy to reduce the growth of Shanghai housing prices. The policy reduced its housing price index by about 8% had such a policy not been implemented. Moreover, the effects appeared to last as long as such a policy is in place. On the other hand, the pandemic hardly had any impact on the real estate prices in Shanghai, contrary to the findings in the real sector, where Ke and Hsiao (2022) find the pandemic reduces China’s outputs by about 10% because of the break out of pandemic.