1 Introduction

According to World Health Organization (2018), affected by climate change, including rising temperature, rainfall variation and increasing frequency and intensity of extreme events, the global hunger situation is deteriorating and returning to the level of a decade ago after ten years of steady improvement. The total number of hungry people in the world has reached 821 million (i.e., one in every nine people in the world is hungry) of which the Asian population accounts for 63% (WHO, 2018). Conceição (2019) and Chen et al. (2020) argued that the largest crop yield decline caused by climate change will occur where food insecurity is already a threat, many of which belong to developing countries. Because they are located in tropical/subtropical regions, where climate signals will be faster and easier emerged from the "noise" (King & Harrington, 2018). Most results also showed that climate change has caused damage to crop yields in low and middle latitude regions (Miao et al., 2016; Schlenker & Roberts, 2009), while it is predicted that demand for grain is expected to increase by approximately 46% by 2050 (Keating et al., 2014).

Nearly half of the world's population regard rice as the main source of calorie intake, and the high water consumption of rice growth is more susceptible to the effects of climate change, which have caused widespread concern for rice. As the world's largest rice producer, China has an average annual total output of 22% of the world's rice production (FAO, 2017). Due to the decline in production brought about by climate change and the consideration of related trade benefits, it is not surprising that China's rice production has attracted the attention of the world, especially for Asian countries where rice is the staple food. But climate change is destroying the production of major crops such as rice in tropical and temperate regions (Shrestha et al., 2016). The stagnation of rice yield has been observed in Southern China (Peng et al., 2009; Wei et al., 2017). Fortunately, the higher temperature induced by global warming may also make large-scale crop production possible in high-latitude regions where crops were rarely cultivated previously (Du et al., 2017). Meanwhile, in order to prevent the chain effect of rising food prices and loss of income due to food shortages that could further reducing people's access to food, the government proposed minimum support price policy on rice. Among them, the Northeast China are the regions implementing price policies, which further promotes its steady growth and guarantees food security. The national bureau of statistics data from China shows that the average yield of rice in high-latitude regions increased by 30% from 1992 to 2018, of which 73.5% of the regions have a growth rate of more than 15%, while rice yield in Southern China tends to stagnate over past 20 years. And yield growth shows a trend of spreading from south to north (See Fig. 1). Therefore, it is meaningful to investigate rice production in high-latitude regions under climate change for future food supply and food security.

Fig. 1
figure 1

Average annual yields of rice (kg/mu) from 1992 to 1997 (a), 1998 to 2004 (b), 2005 to 2010 (c), 2011 to 2018 (d); 1 mu = 0.1647369 acre; 1 kg/mu = 6.07028541 kg/acre

The literature on estimating yield response to climate has a long history in agricultural economics, and they found that climate variables significantly contribute to crop yield and its variability (Schlenker & Roberts, 2009). And recent studies have started to continuously estimate the impact of climate and economic factors (Miao et al., 2016; Weersink et al., 2010). The main consideration is that ignoring the price factors leads to overestimates of adverse impacts of climate change. Different types of proxies for expected prices were adopted, such as lagged-year received prices and current-year futures prices (Lin & Dismukes, 2007). However, as Nerlove (1956) noted, farmers react not only to last year's price, but also to the price they expected that depends on the last year's price with a limited extent. And Clark and Fleming (1990) and Yang et al. (2015) argued that price support policy issued by the government that sets the price floors for crops would alter farmers’ price expectations. But in the literature on price policy in China, the main method of incorporating the price policy is to construct dummy variables for estimation (Zhang et al., 2020), which might be rough as it ignores the impact of floor prices on farmers’ price expectations.

Empirically, regression models have been used with panel data to examine the response of crop yields to climate variables. The first classical model is the least square dummy variable (LSDV) or fixed effects model. These models consider the heterogeneity of individual and time. For example, Lu et al. (2018) investigated how the crop yield respond to climate variables on dryland and irrigated land by exploiting a fixed effects panel model. Another more widely employed model is the stochastic production function model. Compared with the fixed effects model, a stochastic production function better captures the production uncertainty which appeared as heteroscedasticity in the regression model (Just & Pope, 1979). Thus, tests against heteroscedasticity will always be necessary. Previous studies have widely accounted for heteroscedasticity using the stochastic production function (e.g., Weersink et al., 2010). And limited studies used the spatial error model (SEM) to model spatial autocorrelation of the error term (Chen et al., 2016; Miao et al., 2016). However, neither the spatial correlation of the error term nor the applicability of models has been tested in dealing with heteroscedasticity. This might lead to a biased and inefficient estimation (Beenstock & Felsenstein, 2019; LeSage, 1997).

The purpose of this paper is threefold. First, we seek to examine the climate and price drives of rice yield in high-latitude regions with the case of Northeast China. We absorb the monthly accumulated growing degree days enter into the model which is a more comprehensive method than previous studies. For example, most studies used the total growing degree days to reflect the effects of accumulated heat (e.g., Lu et al., 2018; Miao et al., 2016), but they could not figure out the difference of heat effects in different growth stages. We also construct expected prices based on the incorporation of price support policy and other important price information in market, and take Kalman filter method to test whether the new information in market is useful in forming farmers’ expectations. Second, the heteroscedasticity and spatial correlation of the econometric model are considered simultaneously. We use two different Breusch-Pagan tests to investigate the applicability of the typical panel model and develop a spatial autoregressive combined model (SAC) to address the above issue. Third, we further predict the variability of rice yield under future climate scenarios with “lower” and “higher” rates of global warming and the contribution of various climate variables in the medium term and long term by using the latest climate change predictions of the Intergovernmental Panel on Climate Change (IPCC)'s Sixth Assessment Report (AR6) as of 2019.

The next section presents the empirical strategy, including research area and data, and econometrics model used in the analysis. The penultimate section contains estimation results and predictions, with the final section giving a discussion of conclusions.

2 Study area and methods

2.1 Study area and data

We select three northeast provinces of China (Heilongjiang, Jilin and Liaoning) as our study area. These are the coldest crop regions in China. Due to being located at higher latitudes, winters in northeastern regions are cold and long, while summers are cool and humid. However, global warming has increased the average efficient accumulated heat in rice growing season in high-latitude regions of China by 215.62℃, from 1249.39℃ in 1992 to 1465.01 ℃ in 2018. Coupled with the technological production progress, these regions, which previously cultivated less rice, are starting to carry out large-scale production of rice, a thermophilic crop.

Rice production in high-latitude regions is playing an increasingly important role in China. Over the past 27 years, the total production of rice in the northeast provinces has grown rapidly. It has increased by 3.43 times from 10.92 million tons to 37.50 million tons (China Agriculture Statistical Report, 2019). One of the most outstanding performances occurred in Heilongjiang province. The rice production of this province in 1992 was 3.77 million tons, which accounted for 0.02% of China’s rice production, but by 2018, it had increased by 7.13 times to 26.86 million tons, which is 12.66% of the total national output.

The rice yield in high-latitude regions has also experienced a rapid increase over the past 27 years, and districts with a high yield have also gradually spread from the south to the north (Fig.1). The average yield increased from 403.16 kg/mu to 487.76 kg/mu with an average net increase of 84.60 kg per mu. The minimum rice yield has increased from the initial 262 kg/mu to 382.73 kg/mu, with an increase of 46.1%. In the meantime, the yield in the northernmost region has also experienced rapid growth. From Fig.1, we can see that the northernmost region, from its initial yield of less than 387.0 kg/mu, has increased to a yield range of 431.1 to 500.0 kg/mu.

The annual city-level rice production, rice area, total sown area and fertilizer usage were obtained from the Provincial Statistical Yearbook of China (Heilongjiang, Jilin, Liaoning) from 1993 to 2019, which covers 36 cities. However, we excluded the Greater Khingan Range and Chaoyang because the production of rice in these two areas was zero in most years. Rice yields are derived by the city-level rice production divided by the corresponding rice area. The fertilization intensity is calculated by dividing the total fertilizer usage by the total sown area. We collected the received prices for rice (PRt) and corn (PCt) from the China Agricultural Products Price Yearbook. The variables in expected price model including international future price (\({PR}_{t}^{IF}\) for rice and \({PC}_{t}^{IF}\) for corn) and imported spot prices (\({PR}_{t}^{IS}\) for rice and \({PC}_{t}^{IS}\) for corn) were obtained from Chicago Board of Trade (https://www.cmegroup.com/company/cbot.html) and UN Comtrade database (https://comtrade.un.org), respectively. The annual rice minimum purchase price (MPt) and domestic corn future price (\({PC}_{t}^{DF}\)) were collected from China National Development and Reform Commission (CNDRC) and Wind database (https://www.wind.com.cn), respectively. The daily historical climate data were collected from the China Surface Climate Database based on values from weather stations located within the crop districts.

2.2 Empirical model

The general function of climate-yield model is to regress the crop yield \({Y}_{i,t}\) against the climate variables \({X}_{i,t}\) in region i for year t:

$${Y}_{i,t}=f\left({X}_{i,t},\beta \right)+{u}_{i,t}$$
(1)

To address the heteroscedasticity, previous literature on climate change mainly adopted two methods to construct the disturbance \({u}_{i,t}\).

First, the error term can be predicted by the regressors. Just and Pope (1979) accounted for heteroscedasticity in the model by employing a stochastic production function. The error term (\({u}_{i,t}\)) can be assumed to beFootnote 1:

$${{u}_{i,t}=h({X}_{i,t})}^\frac{1}{2}{u}_{i,t}$$
(2)

where \({h({X}_{i,t})}^\frac{1}{2}\) is a function of the matrix of variables included in the regressors of \({X}_{i,t}\). The estimation of \({h({X}_{i,t})}^\frac{1}{2}\) is typically undertaken as a three-step procedure called the feasible generalized least squares (FGLS). In the first stage, the consistent estimator of \({u}_{i,t}\) is obtained by regressing \({Y}_{i,t}\) on \(f\left({X}_{i,t},\beta \right),\) and the residuals are then logged and squared. Another OLS is undertaken to regress the logged and squared residuals on \({X}_{i,t}\) to estimate the variance source of the stochastic production function in the second stage. The third stage takes the square root of the predictions of the second stage to determine \({h({X}_{i,t})}^\frac{1}{2}\).

Second, the error term is spatially correlated, and heteroscedasticity appears across space. Regions located near to each other might have similar practices or input factors, like land quality and climate conditions, which are spatially correlated and cannot be observed easily (Chen et al., 2016; Miao et al., 2016). This could lead to spatial correlation of the error term. The error process can be modeled by spatially autoregressive process using spatial panel econometrics:

$${u}_{i,t}=\theta W{u}_{i,t}+{e}_{i,t}$$
(3)

where \(W\) is \(N\times N\) spatial weight matrix that does not change with time, and \(N\) is the number of districts. \({e}_{i,t}\) is an error term that is independent and identically distributed.

In addition, the technology diffusion of the crop yield causes another spatial-connection process (Kostov, 2010). Thus, we further apply a spatial weight matrix on the dependent variable that represents the technology diffusion effects between neighbors. The Lagrange multiplier test for spatial lag significantly rejects the null hypothesis (See Table 3), which also proves the rationality of taking spatial autocorrelation of the dependent variable into account. Our yield model is as follows:

$$\begin{aligned}{Y}_{i,t}=&{\rho W}_{1}{Y}_{i,t}+{\beta }_{0}+{P}_{i,t}{\beta }_{1}+{S}_{i,t}{\beta }_{2}+{Z}_{i,t}{\beta }_{3}\\&+{\Gamma }_{i,t}{\beta }_{4}+{\lambda }_{i}+\theta {W}_{2}{u}_{i,t}+{e}_{i,t}\end{aligned}$$
(4)

where \({W}_{1}\) and \({W}_{2}\) are an \(N\times N\) spatial weight matrix that does not change with time. \({Y}_{i,t}\) is the yield in district \(i\) and year \(t\), \({P}_{i,t}\) is a vector of expected output prices, \({S}_{i,t}\) denotes the fertilizer application intensity proxied by the amount of fertilizer application and \({Z}_{i,t}\) is a vector of climate factors. The time trend vector \(\Gamma\) includes linear and quadratic forms of the time trend to capture the advance in technology and the improvement in agronomic practices. \({\lambda }_{i}\) is a city-level fixed effect to model the unobserved heterogeneity of districts. \({u}_{i,t}\) is the disturbance of function and \({e}_{i,t}\) is an independent and identically distributed error term. We assume that \({Y}_{i,t}\), \({P}_{i,t}\) and \({S}_{i,t}\) in the specification of (4) take the logarithmic form.

To construct the spatial weight matric \({W}_{2}\), we define neighbors as four nearest districts to a district. But the spatial weight matrix \({W}_{1}\) is defined as three nearest districts to a district.Footnote 2 Because rice cultivation in China is spread from south to north (See Fig. 1), the technology diffusion effects of the yield do not include technology diffusion from north to south.

The model is constructed for the period from 1992 to 2018 for rice in the high-latitude regions of China crossing 34 districts. The yield variable is the rice yield in kilograms per mu of aggregate rice. Included in the expected output price vector is the expected rice price and expected price of the competitive crop corn. As Miao et al. (2016) and Xu et al. (2019) noted, increase in crop price could affect the crop yield by increasing farmers’ expectation on crop revenue, and promote their production management.\({S}_{i,t}\) is estimated by the amount of fertilizer applied per mu to denote the fertilizer application intensity, which is different from many studies in the existing literature that used fertilizer price (Haile et al., 2016; Miao et al., 2016). The price of chemical fertilizers in China is relatively stable, and the application of chemical fertilizers has not reached the optimal level. This is the reason why farmers are not sensitive to changes in fertilizer prices. Therefore, it is the intensity of fertilizer application rather than the fertilizer price that directly affects yield changes in China. \({Z}_{i,t}\) includes the monthly total precipitation and its squared terms, frozen degree days (FDD), the monthly accumulated growing degree days (GDD), the monthly temperature deviation and the sunshine, which all span rice growing season from May to September. We include these disaggregated climate variables to examine the potential impacts of timing and seasonal variation in climate variables on rice yield.

Specifically, the monthly average, minimum and maximum temperature, as well as monthly total precipitation for each city from May 1992 to September 2018 are calculated by daily climate data. The monthly temperature deviation is defined as the difference between the monthly maximum temperature and the monthly minimum temperature. And the sunshine variable refers to the total sunshine hours received during the rice growing period. We conclude that efficient GDDs for rice growth are in the range 10–35℃ following studies by Wang et al. (2014). GDDs below 0℃ are defined to be frozen degree days to reflect the impact of extremely low temperature on rice yield throughout the entire growing season. Due to the lower temperatures and less high-temperature weather in the three provinces of Northeast China, the negative effect of extreme weather on crops is mostly frost damage. Therefore, overheat degree days (ODD) are not included in our model. GDDs in the range of 10–35℃ and frozen degree days are calculated based on daily temperature data following the simplest fitted sine curve method illustrated by Baskerville and Emin (1969).

We next use two Breusch and Pagan tests to distinguish the typical heteroscedasticity and spatial correlation. From the perspective of econometrics, the statistic tests are needed to discuss the applicability of the Just–Pope model for the spatial panel data and the applicability of spatial panel models for solving typical heteroscedasticity. The first Breusch and Pagan test (Breusch & Pagan, 1979) is given by

$${CR}_{1}=NT{R}^{2}\sim {\chi }^{2}\left(K-1\right)$$
(5)

where \(N\) is the number of panels and \(T\) is the time series sample size per panel. \(K\) is the number of columns of regressors \(X\), and \({R}^{2}\) is the r-squared value coming from the second stage regression of the Just–Pope model.

Another Breusch and Pagan test (Breusch & Pagan, 1980) can be written as

$${CR}_{2}=T{\sum }_{i=1}^{N}{\sum }_{j\ne i+1}^{N}{r}_{ij}^{2} \sim {\chi }^{2}\left(\frac{1}{2}N(N-1)\right)$$
(6)

where \({r}_{ij}^{2}\) is the squared correlation coefficient between the \({i}^{th}\) and \({j}^{th}\) spatial panels in the data (Beenstock & Felsenstein, 2019), and \(N\) is the number of panels.

In this study, we explore the applicability of Just–Pope FGLS approach for spatially correlated panel, and the ability of spatial panel models to solve heteroscedasticity. Therefore, we first compare the conclusions regarding the \({CR}_{1}\) from the least square dummy variable (LSDV) model that is typically used in the literature with the conclusions regarding the \({CR}_{1}\) test assuming a spatial error structure or that both spatial error and spatial lag exist across spatial panels. We also compare the findings regarding the \({CR}_{2}\) test applied to the LSDV, Just–Pope FGLS, SEM, and SAC residuals to calculate the \({r}_{ij}^{2}\) of the \({CR}_{2}\) test given in Eq. (6).

2.3 Construction of price policy into expected price

After developing the yield model with expectations of output prices, we further discuss how to test the usefulness of price policy information in forming price expectation and then construct price expectation under the price policy. Since 2004, China has implemented a price support program “Minimum Purchase Price Policy (MPPP)” on rice in some provinces to protect farmers' interests and ensure the supply of rice markets, including Heilongjiang and Jilin. Liaoning has also joined the price support program since 2008. When the market price is lower than the minimum purchase price determined by the government, the government entrusts a grain enterprise that meets certain qualifications to purchase farmers' grain at the minimum purchase price, which is similar to the “buffer stock” policy in America or “initial payment” policy in Canada. And corn futures varieties were listed on the Dalian Commodity Exchange in China in 2004. Therefore, farmers who planted rice or corn have new information to predict their crop expected prices.Footnote 3 To ensure that these new information are useful to predict prices, we will take Kalman filter theory to test it. Sargent (1979) and Clark et al. (1992) argued that Kalman filter is a useful method to test whether the market change provides farmers with useful information. This suggests that the new estimator of \(P\) given old information set \(\Omega\) and new information \(Z\) is

$$f\left(P|\Omega ,Z\right)=f\left(P|\Omega \right)+f[(P-f(P|\Omega ))|(Z-f(Z|\Omega ))]$$
(7)

where \(f(\dots )\) represents projection function. The final term is the updated factor given \(Z\) and\(f\left[\left(P-f\left(P|\Omega \right)\right)|\left(Z-f\left(Z|\Omega \right)\right)\right]=f({u}_{P}|{u}_{Z})\), where \({u}_{P}\) is the forecast error of prices given the old information set \(\Omega\), and \({u}_{Z}\) is the forecast error of prices given the new information \(Z\). Therefore, to test whether the new information in market is useful is to regress \({u}_{P}\) on \({u}_{Z}\) and judge the significance of coefficient. The new information is helpful to predict prices if obtain the significant t-value and otherwise useless. Farmers with rational expectations will incorporate \(Z\) into their predictions of prices if the new information is useful to predict.

Generally, the expectation of price adopted by farmers is autoregressive (Clark et al., 1992), which can be easily estimated using ordinary least square method. And price policy and futures market might also provide additional information for farmers’ expectations (Haile et al., 2016; Lin & Dismukes, 2007). As minimum purchase rice price is a guaranteed price and is known with certainty by farmers before rice is planted, price policy could exist in two different situations, binding and not binding to the price floors. The expectation equation of rice price is under a constrain of \({P}_{t}\ge {MP}_{t}\). That is

$${P}_{t}={\beta }_{0}+{{\beta }_{1}(L)P}_{t-1}+{\beta }_{2}\left(L\right){X}_{t}+{u}_{t}, s.t. {P}_{t}\ge {MP}_{t}$$
(8)

where \({P}_{t}\) is the crop prices that farmers wish to predict in year \(t\), \({X}_{t}\) represents other information used by farmers to predict prices, mainly including international futures prices and imported spot prices in China (Lin & Dismukes, 2007; Xu et al., 2019), \({MP}_{t}\) is the minimum purchase price, \(L\) is the lag operator, \({\beta }_{0}\), \({\beta }_{1}\), \({\beta }_{2}\) are estimated coefficients, and \({u}_{t}\) is an error term. And typical autoregressive model might no longer represent the true distribution of rice price, as the probability that rice price can fall below the minimum purchase price is zero. This distribution is called truncated distribution. This means that farmers would care whether the actual price is higher than minimum purchase price, or how much higher the actual price is. Then an expected price equation with floor price would become

$${E}_{t-1}{P}_{t}=M{P}_{t}+{D}_{t}\left({\beta }_{0}+{{\beta }_{1}\left(L\right)P}_{t-1}+{\beta }_{2}\left(L\right){X}_{t}\right)+{\vartheta }_{t}$$
(9)

where \({E}_{t-1}{P}_{t}=E({P}_{t}|{\mathrm{Z}}_{t})\) is the farmer’s expectation on the price of next-term and \(Z\) is the new information set available to farmers when they generate the price expectations,\({D}_{t}\) is the expected price difference which is a function of \({P}_{t-1}\) and \({X}_{t}\). \({D}_{t}\left(\dots \right)\) have a truncated distribution of price at zero when price policy is binding, which can be estimated using a Tobit procedure. \({\vartheta }_{t}\) is equal to the probability density function of \({D}_{t}\) multiplied by the standard deviation of \({u}_{t}\) (Maddala, 1983).

3 Results and discussion

3.1 Model preparation

3.1.1 Crop price expectations

In the estimation procedure, a two-step method is developed by Clark et al. (1992). In the first step, rice price policy and other useful price information in market are incorporated into the expected price equations estimated by OLS and Tobit approach. In the second step, price expectations of the first step are entered into the yield model to estimate the response of rice yield to climate and price. Table 1 presents the OLS and Tobit equations for predicting rice and corn prices. We obtain price expectations for rice and corn by predicting prices based on the estimated equations.

Table 1 Price expectation equations for rice and corn

Clark et al. (1992) argued that a test of the importance of the floor price to predict harvested price could be estimated using Kalman filter, that is, regress the forecast error with the minimum purchase price on the forecast error without the minimum purchase price. A significant t-value in the resulting regression indicated that the minimum purchase price helps predict harvested price. The result in Table 2 shows that rice minimum purchase price and domestic corn future price both add useful information to the old information set, which is helpful in predicting the harvested price.

Table 2 The test of whether the new information is useful

3.1.2 Model diagnostic tests

We use Moran’s I test and Lagrange Multiplier (LM) test to examine spatial autocorrelation and choose the correct spatial model (Table 3). The results show that the value of Moran’s I statistic for rice yield in the most recent year is 0.55 and this is significant at the 1% level. We also use the LM spatial lag test and the LM spatial error test (Elhorst, 2014, pp.57–59) to test our spatial model. Both hypotheses are rejected, supporting our setting of the SAC model. The spatial correlation of the error term comes from the similarity of practices or production characteristics of cities, which has been proved by many scholars (Chen et al., 2016; Miao et al., 2016). Lin (1991) found that advanced planting methods or technology could flow into areas lacking in technology, thereby increasing the yield of neighbors, which shows the spatial correlation of the yield.

Table 3 Spatial model test

We further conduct two BP tests for four models, respectively, and the results are listed in Table 4. The Breusch and Pagan Chi-square statistics for spatial correlation are statistically significant in the LSDV model, Just–Pope FGLS model and SEM model but are not significant in the SAC model. Specifically, the results reveal that under the LSDV model, both the \({CR}_{1}\) and the \({CR}_{2}\) tests strongly indicate the presence of heteroscedasticity in the data, with the \({CR}_{1}\) test suggesting that heteroscedasticity is related to the regressors. This type of heteroscedasticity is no longer significant when allowing for spatial autocorrelation in the model. However, the results of the \({CR}_{2}\) test indicate that spatial dependence persists when the model is corrected for the type of heteroscedasticity considered by Just and Pope (1979) or when only spatial autocorrelation of the error term is allowed. Therefore, spatial correlation is a more important source of heteroscedasticity if the spatial nature of the data is considered. And the spatial error model could address the heteroscedasticity caused by the correlation of independent variables and the error term. It does not completely address the heteroscedasticity originating from the spatial dependence of the dependent variable, although part of the heteroscedasticity is derived from spatial autocorrelation of the error term (Beenstock & Felsenstein, 2019).

Table 4 Breusch and Pagan \({\chi }^{2}\) tests for heteroscedasticity and spatial correlation

3.2 Regression results

The estimation results for the rice yield response models (i.e., LSDV, FGLS, SEM, and SAC) are presented in Tables 5, 6 and 7.

Table 5 The results of economic and agronomic block
Table 6 The results of precipitation block
Table 7 The results of temperature block

As expected, spatial error and spatial autoregressive parameters are both positive and significant in the SAC model. The significantly positive spatial autoregression effect reveals that the yield of neighboring regions has a positive impact on the yield of this region. This supports our hypothesis on the significant technology diffusion effect of the yield. And fertilizer application intensity has a strongly positive spillover effect accounting for 28% of the total effect, which also partly confirmed that there is a technology diffusion effect between diffusion among neighboring cities (See Table 11 in the Appendix). An explanation for the strong indirect effect of fertilizer is that the increase in yield caused by fertilizer application can be easily observed by farmers in neighboring areas and extended into their own agricultural production practices. Therefore, compared to the studies by Miao et al. (2016) and Chen et al. (2016), we find that spatial autoregressive effect has a significant positive impact and cannot be ignored in our study area.

The rice yield responds positively to the expected rice price and negatively to the expected corn price, and both effects are significant in the four models. Corn, a crop competitor to rice, is planted between May and October, while rice is planted from May to September. Many farmers' fields are planted with corn and rice at the same time. As we can see in Table 5, the expected price elasticities of rice and corn on rice yield are 0.194 and -0.097, respectively. Combining the price expectation function and yield function, we could also find that the price support policy would have a positive effect on rice yield through positively affect the rice price expectation. And the rice expected price effects become smaller when considering spatial effects, which indicates that the spatial correlation is partly caused by the correlation of expected rice prices.

The results for linear and quadratic terms of the monthly precipitation are presented in Table 6. Throughout the growing season, precipitation in July and August matters for the rice yield. July and August are the booting-earing stage of rice (Xu et al., 2008). This period is the time during which rice has the most physiological water requirements in its lifetime, and it is also the period with the weakest drought resistance (Ye et al., 2015). A lack of water at this time will lead to a reduction in the grain number per spike, directly affecting the yield. For precipitation in May, July and August, the pattern of negative coefficients for the quadratic terms implies the presence of an inverse U-shaped relationship between rice yield and precipitation. This means that increasing precipitation is beneficial for rice yield before the precipitation reaches the threshold level. Beyond that level, the increase in precipitation will lead to a decline in rice yield. But the precipitation in May, the rice seedling-transplanting period, has minimal and insignificant impact on rice yield, mainly due to the low demand for precipitation during this growing period and the soil moisture accumulated before transplanting. In addition, June and September are the two months where the precipitation effect shows an U shape, where lower levels of precipitation decrease the yield and high levels of precipitation increase the yield. Within the current sample range, the impact of precipitation on the rice yield in June and September is found to be negative, although this result is not significant. The impact of precipitation is found to vary from month to month, probably because the role played by precipitation may be related to crop phenology and heat pressure.

The results of the accumulated heat response are listed in Table 7. Models basically find that the accumulated GDD in June and September is significant, which is partly consistent with previous studies (e.g., Lu et al., 2018; Miao et al., 2016). They pointed out that the total GDD had an inverse U-shaped relationship with yield, where the total GDD was equal to the accumulated GDD in September in our study. This argument indicated the influence mechanism of GDD, that is, crop yields respond positively with increased GDD until GDD reaches the threshold, but beyond that level, GDD will show the negative effects to yield. But it neglected the heterogeneity of the monthly impact of GDD. We find the importance of the monthly accumulated GDD. Specifically, accumulated GDD in June has a significant positive impact on the rice yield, while accumulated GDD in May and August show the negative effects and it’s not significant. The FDD is found to be harmful to the rice yield, but this is only significant for the Just–Pope FGLS model. And the sunshine variable has not been found to have a significant positive impact on the rice yield, which is related to the fact that rice is a short-day crop.

Our results also show that monthly temperature deviations in July to September are beneficial for rice yield, while that in May and June are harmful, although the estimated coefficient is not significant in June. The reason for the above pattern could be that larger temperature deviations in May and June demonstrate that weather is colder than usual at night and may cause frost damage to the crop. From July to September, however, a larger temperature deviation is more likely a symbol of warm weather, which is beneficial for rice growth. Similar to our results, Miao et al. (2016) pointed out that the timing of this temperature deviation is also important. However, Miao et al. (2016) argued that due to the existence of overheating, the larger the temperature deviation in the early stage of planting, the larger the yield, and the larger the temperature deviation in the later stage, the worser the yield. This is exactly the opposite of what is found in our study. An explanation for this is that our research area is relatively cold and frost damage is more prominent than overheating damage. This indicates that the negative effects of temperature deviations in May and June and the positive effects in the next three months of the growth period are from colder and warmer weather, respectively.

By combining the coefficients of precipitation and heat effect, we find that precipitation and accumulated heat have seasonal imbalanced effects during rice green-filling stage from June to August in the high-latitude regions. Table 8 shows the monthly and cumulative total effects of precipitation and growing degree days separately, where the estimation of the precipitation effects is based on the sample mean because of its quadratic term. We further compare the effects of monthly and cumulative precipitation and GDD on yield. We find that during rice green-filling period from June to August, the core stage of rice growth, precipitation is positive while GDD is negative, or precipitation is negative while GDD is positive. This indicates a seasonal imbalance between water and heat supply. For example, precipitation in August has a positive effect but GDD in August has a negative effect on yield. This means that the temperature in August is too high relative to the demand for rice growth, although the actual temperature in August may not have been that high. It can make the rice growing environment dry in the absence of precipitation, so the effect of GDD is negative. Meanwhile, the impact of precipitation is positive. This indicates that an increase of GDD does not necessarily have a positive impact on the growth of crops, as it also requires the cooperation of precipitation. The effects of GDD and precipitation on rice growth are both important and indispensable. Only when the two effects reach the optimal equilibrium point can they exert the greatest positive effect on the yield.

Table 8 Monthly and cumulative total effects of precipitation and GDD

3.3 Forecast of crop yield under different climate change scenarios

The empirical results presented in the previous sections show the impact of climate variables on rice yield. We further quantify the impact of future climate change on yields in high-latitude regions of China. In conducting this investigation, projections of future climate are required. We collected projections of future temperature and precipitation under different future climate scenarios from phase six of world climate research program (Coupled Model Intercomparison Project, CMIP6). This program covers daily climate information, including average temperature, maximum temperature, minimum temperature and precipitation. And there are eight pathways (SSP1-1.9, SSP1-2.6, SSP3-7.0, SSP2-4.5, SSP4-3.4, SSP4-6.0, SSP5-3.4OS and SSP5-8.5) to represent different global warming scenarios, among which SSP1-2.6, SSP2-4.5, SSP4-6.0 and SSP5-8.5 are the upgraded projections of RCP2.6, RCP4.5, RCP6.0 and RCP8.5 in CMIP5, respectively. In order to be consistent with previous studies and make it comparable, we select SSP1-2.6 and SSP5-8.5 based on the BCC-CSM2-MR climate model of CMIP6, which represent scenarios of “lower” and “higher” rates of global warming, respectively. And we define the periods of 2041–2060 and 2061–2080 as the medium term and long term, respectively.

Table 9 presents summary statistics of projected climate variables under “lower” and “higher” rates of global warming scenarios in the medium term and long term. The results show that accumulated GDD and FDD are become smaller on average, which indicates that the two climate scenarios are warmer. This finding is consistent with the estimation by Miao et al. (2016). At the same time, in general, both future scenarios have larger precipitation for each month compared with the benchmark, while monthly temperature deviations are smaller.

Table 9 Summary statistics of historical and projected climate variables
Table 10 The average effects of climate change on yield and its decomposition

We further predict the percentage change of rice yield relative to the sample mean from 1992 to 2018 for two climate scenarios in the medium term and long term by using the projected climate variable dataset discussed above and the estimated model. Rice yield will increase with a range from 0.095% to 1.769% on average, except for the case of lower-rate global warming in the medium term, which indicates that the high-latitude regions of China may be an area with the potential to expand rice cultivation in the face of sustained temperature rising. However, in the mode of higher-rate global warming, the rate of increase in yield will slow down in long term, which is consistent with the finding by Xie et al. (2020).

The effects of climate change on rice yield are further decomposed into GDD, precipitation, FDD and temperature deviation effects (Table 10). Overall, the results show that increase in GDD, precipitation and FDD mainly lead to decrease in rice yield, while temperature deviation shows positive effects. Specifically, the projected changes in GDD decrease the rice yield with a range from -4.504% to -6.534%, and long-term, higher-rate climate warming will make GDD exert a smaller negative effect in cold high-latitude regions. The influence of FDD is limited ranging from -1.295% to 0.055%, and in the case of long-term higher-rate climate warming, it will change from a negative impact to a positive impact at 0.055%. In addition, although precipitation has a negative impact, the magnitude is small, ranging from -0.722% to -1.869%. And this negative impact could be offset by the establishment and improvement of irrigation and drainage systems. Rice yield could gain benefit from the change of temperature deviation, no matter in the medium term or long term, ranging from 5.786% to 9.821%.

4 Conclusions

This study investigates the climatic and economic determinants of rice yield in the high-latitude regions of China from 1992 to 2018. As compared to previous studies, we incorporate rice price policy into the yield model at the expected price using a Tobit econometric procedure, and take Kalman filter theory to test whether the new information in market is useful in forming farmers’ expectations. We allow the expectation of prices and the monthly accumulated growing degree days to enter into model, which is more comprehensive than previous studies. Additionally, we explore the issues of spatial correlation and heteroscedasticity by using two different Breusch and Pagan tests. And the spatial autoregressive combined model is adopted to address the spatial autocorrelations and heteroscedasticity simultaneously, which has rarely been discussed in previous studies.

We conduct two Breusch and Pagan tests on three typical models—LSDV, FGLS, and SEM—and our SAC model. We find that spatial correlation is a more critical source of heteroscedasticity than the Just-Pope type heteroscedasticity when spatial effects are taken into account. Spatial correlation cannot be completely addressed by only allowing spatially autocorrelated error term since the possible technology diffusion in neighboring areas also makes the dependent variable yield spatially correlated.

Our results show significant climate and expected price effects on rice yield in the high-latitude crop regions of China. The influence of precipitation on rice yield in the high-latitude regions of China has a nonlinear relationship and matters in the middle of rice growing season. As for the monthly accumulated GDD, we identify the importance of the monthly GDD. In particular, the accumulated GDD in June has positive effects on the yield, a monthly factor that has been neglected by most researchers. We also find that the effect of monthly temperature deviations on rice yield differs across the months in which it occurs, with negative effects occurring in the early growing season (May and June) and positive effects occurring in July, August and September. The possible reason for this is the lower temperatures in May and June and the warmer temperatures in July, August and September.

The impact of precipitation on yield is related to the GDD when calculating the monthly and cumulative total effects based on the sample mean. There is a vital phenomenon in which the total effects of GDD and precipitation are opposite during rice green-filling period from June to August, revealing the issue that precipitation and accumulated heat have seasonal imbalance in the core stage of rice growth. But the effects of GDD and precipitation on rice growth are both important and indispensable. Only when the two reach the optimal equilibrium point can they exert the greatest positive effect on the yield. Additionally, the price elasticities of rice and corn are 0.194 and -0.097, respectively. We highlight the importance of constructing price policy into price expectation.

Using projections of future temperature and precipitation under different future climate scenarios in the medium term (2041–2060) and long term (2061–2080), we forecast that in addition to the case of lower-rate global warming in the medium term, rice yield will increase with a range from 0.095% to 1.769% on average, but the rate of increase in yield will slow down in the higher-rate global warming. After decomposing the effects of climate variables on rice yield, the results show that changes in GDD, FDD and precipitation mainly lead to a decline in rice yield, while temperature deviation still shows positive effects.

However, this paper is not without its limitations. Due to data limitations at the city level, we cannot consider the effects of agronomic factors, like irrigation conditions, mechanization level and plot conditions, in the empirical model, which can be further strengthened in future research. And further work could include crop area in the analysis to jointly investigate production identity.