The econometric framework we adopt is the general class of Gaussian linear state space models, which we estimate by Maximum Likelihood implemented via the diffuse Kalman filter (Harvey 1989; Durbin and Koopman 2012). These models, also known as Structural Time Series Models, have been employed as forecasting devices in agricultural economics, in climatic sciences mainly in the guise of the Ensemble Kalman filter used to discretize the partial differential equations used in geophysical models (Hannart et al. 2016), and, closer to our approach, also as a way to represent temperature trends (Mills 2010) and trends in the impact of weather-related disasters (Visser et al. 2014). This approach enables us to perform an explicit, rigorous and general analysis of the trend components in crop yield, while capturing the potential role of explanatory variables, the impact of which is allowed to vary across time as coefficients are potentially stochastic.Footnote 1
We perform our empirical analysis in two distinct stages, each devised to tackle a different research question. In the first stage, we examine the nature of the trend in crop yield and we explore the evidence on the existence of sudden permanent changes, i.e. structural breaks, in the slope of the trend. In the second stage, we investigate whether the trend in crop yield is influenced by weather variables and provide a quantitative measure of their impact. We assess the validity of the estimated models by running standard diagnostic checks on the residuals of the Kalman filter, that is the Ljung-Box test for serial correlation, the Jarque-Bera test for normality, and an F-test for heteroskedasticity that compares the variance of the first and last third of the observations in the sample (see Durbin and Koopman 2012). Throughout the estimation, we include pulse dummies to treat occasional outliers identified through the plot of the auxiliary observation residuals. In the following, we describe in more detail each of the two stages of the analysis.
Stage 1
The aim of the first stage is to assess the nature of the trend and any evidence on the existence of a structural break in its slope. With this aim, we estimate a set of local linear trend models in which the level and the slope are potentially stochastic. The general univariate model for crop yield, i.e. without any explanatory variable, is defined as follows.
$$ \left\{\begin{array}{cc}{y}_t={\mu}_t+{\varepsilon}_t& {\varepsilon}_t\sim N\left(0,{\sigma}_{\varepsilon}^2\right)\\ {}{\mu}_{t+1}={\mu}_t+{v}_t+{\eta}_t& {\eta}_t\sim N\left(0,{\sigma}_{\eta}^2\right)\\ {}{v}_{t+1}={v}_t+{\xi}_t& {\xi}_t\sim N\left(0,{\sigma}_{\xi}^2\right)\end{array}\right. $$
(1)
where the first line defines the crop yield yt, while the remaining two equations define the two states of the trend, the level μt and the slope vt, and where all error terms are assumed to follow independent Gaussian distributions. We assess the evidence for a structural break in the long-run yield trend in two ways: we look for outliers in the plot of the slope auxiliary residuals; and we evaluate the additional fit of modelling a structural break in the slope via information criteria.
In the first case, we estimate the model above assuming a deterministic level and a stochastic slope, i.e. (\( {\sigma}_{\eta}^2=0 \), \( {\sigma}_{\xi}^2\ne 0 \)), and we examine the resulting auxiliary residuals of the slope equation. Following standard practice in the literature, the presence of any outliers exceeding the 95% confidence band in the slope auxiliary residuals is interpreted as a sign of a structural break in the slope (Durbin and Koopman 2012).
In the second case, we assess whether a model that explicitly includes a structural break in the slope provides substantial gains in fit compared with a model without this component. The model we consider is as follows.
$$ \left\{\begin{array}{cc}{y}_t={\mu}_t+{\varepsilon}_t& {\varepsilon}_t\sim N\left(0,{\sigma}_{\varepsilon}^2\right)\\ {}{\mu}_{t+1}={\mu}_t+{v}_t+{\eta}_t& {\eta}_t\sim N\left(0,{\sigma}_{\eta}^2\right)\\ {}{v}_{t+1}={v}_t+{\delta}_t{d}_t+{\xi}_t& {\xi}_t\sim N\left(0,{\sigma}_{\xi}^2\right)\\ {}{\delta}_{t+1}={\delta}_t& \end{array}\right. $$
(2)
which includes a pulse dummy in the slope equation, that is dt = 1(t = t∗), where 1(∙) is an indicator taking the value of 1 if the event in brackets occurs and 0 otherwise, with δt being a deterministic coefficient. As the conclusions on the existence of a break might vary across different assumptions on the nature of the trend—being either deterministic or stochastic—we consider all four possible specifications, that is (1) both level and slope deterministic (\( {\sigma}_{\eta}^2=0 \), \( {\sigma}_{\xi}^2=0 \)); (2) stochastic level and deterministic slope (\( {\sigma}_{\eta}^2\ne 0 \), \( {\sigma}_{\xi}^2=0 \)); (3) deterministic level and stochastic slope (\( {\sigma}_{\eta}^2=0 \), \( {\sigma}_{\xi}^2\ne 0 \)); and (4) both level and slope stochastic (\( {\sigma}_{\eta}^2\ne 0 \), \( {\sigma}_{\xi}^2\ne 0 \)). As we do not have strong a priori beliefs on the timing of the break, we treat the break date as an unknown parameter, which is endogenously determined by the data. To this aim, for each of the four models above, we set t∗ equal to one specific year, calculate the corresponding likelihood and repeat this calculation for all possible years, excluding 10% of the observations at the beginning and at the end of the sample to ensure parameter identification in each subsample. The break date is finally estimated as the year producing the model with the largest likelihood.Footnote 2 Once the break date for each model has been estimated, we perform a model selection by applying information criteria over a total of eight models, that is four specifications with a structural break in the slope, as in Eq. (2), and four without, as in Eq. (1). The corresponding loss functions are defined as follows.
$$ IC=\frac{1}{T}\left[- logL+k(T)\left(2+n+b\right)\right] $$
(3)
where n is the number of stochastic states, b = 3 in models with a break and b = 0 in models without a break and k(T) a function that reflects two different ways in which the number of parameters are penalized, i.e. using the Schwartz criterion from Yao (1988), where k(T) = logT, and using the criterion of Liu et al. (1998), where k(T) = 0.299logT2.1.Footnote 3
Application of the information criterion above enables us to simultaneously determine the nature of the trend (by choosing one of the four specifications) and the existence of a structural break in the slope. The stochastic features of the trend selected by the information criterion are examined in light of three sources of evidenceFootnote 4: the size of the Maximum Likelihood (ML) estimate of the state error variance, along with the corresponding t-statistic, which would give an indication on whether the stochasticity is sizeable or negligible; the plot of the smoothed state to verify the presence of noticeable time-variation; and the outcome of the three standard diagnostic tests based on the standardized innovations, which help to ascertain the overall validity of the model. The combined information from these three pieces of evidence allows us to draw a robust conclusion on the deterministic or stochastic nature of the yield trend.Footnote 5
Stage 2
We propose a simple approach to verify the extent to which the trend in weather influences the trend in crop yield. Let us assume that the data-generating process for the crop yield yt is as follows.
$$ {y}_t=\sum \limits_{i=1}^k{b}_i{x}_{it}+{u}_t $$
(4)
where \( {u}_t\sim N\left(0,{\sigma}_u^2\right) \) is an error term and x1t, …, xkt are k exogenous variables generated by the following local linear trend models.
$$ \left\{\begin{array}{cc}{x}_{it}={\mu}_{it}+{\varepsilon}_{it}& {\varepsilon}_{it}\sim N\left(0,{\sigma}_{\varepsilon_i}^2\right)\\ {}{\mu}_{it+1}={\mu}_{it}+{v}_{it}+{\eta}_{it}& {\eta}_{it}\sim N\left(0,{\sigma}_{\eta_i}^2\right)\\ {}{v}_{it+1}={v}_{it}+{\xi}_{it}& {\xi}_{it}\sim N\left(0,{\sigma}_{\xi_i}^2\right)\end{array}\right.i=1,\dots, k. $$
(5)
If we use the univariate representation (1) to describe yt, we have that each state in (1) is the linear combination of the corresponding state of xit.
$$ \left\{\begin{array}{c}{\mu}_t=\sum \limits_{i=1}^k{b}_i{\mu}_{it}\\ {}{v}_t=\sum \limits_{i=1}^k{b}_i{v}_{it}\end{array}\right. $$
(6)
and the same thing holds for the error terms
$$ \left\{\begin{array}{c}{\varepsilon}_t={\sum}_{i=1}^k{b}_i{\varepsilon}_{it}+{u}_t\\ {}{\eta}_t={\sum}_{i=1}^k{b}_i{\eta}_{it}\\ {}{\xi}_t={\sum}_{i=1}^k{b}_i{\xi}_{it}\end{array}\right. $$
(7)
If we observe one of the explanatory variables above, say xmt, and we explicitly control for it in the observation equation, the resulting level state captures the trend that is specific to all the other variables xit with i ≠ m
$$ \left\{\begin{array}{c}{y}_t={b}_m{x}_{mt}+{\overline{\mu}}_t+{\overline{\varepsilon}}_t\\ {}{\overline{\mu}}_t={\sum}_{i\epsilon {I}_m}{b}_i{\mu}_{i t}\end{array}\right. $$
(8)
where \( {\overline{\varepsilon}}_t={\sum}_{i\epsilon {I}_m}^k{b}_i{\varepsilon}_{1t}+{u}_t \), and Im = {1, …, k} − m. Thus, one can characterize the contribution of the trend in xmt to the determination of the observed trend in yt by looking at how the estimated trend changes once we control for that variable, that is examining the difference between the level and slope states obtained without and with the inclusion of the variable xmt
$$ \left\{\begin{array}{c}{\mu}_t-{\overline{\mu}}_t={b}_m{\mu}_{mt}\\ {}{v}_t-{\overline{v}}_t={b}_m{v}_{mt}.\end{array}\right. $$
(9)
With regard to our specific empirical application, we consider a model that includes the two trend components as before but also two regressors, the level of temperature and precipitation, constructed as described above. In the case of temperature, we allow for its impact on yield to vary over time in order to capture potential non-linear effects, whereas, for precipitation, we decide for a constant coefficient.Footnote 6
Hence, the model we consider can be written as follows.
$$ \left\{\begin{array}{cc}{y}_t={\overline{\mu}}_t+{\beta}_t{T}_t+\rho {P}_t+{\overline{\varepsilon}}_t& {\varepsilon}_t\sim N\left(0,{\sigma}_{\overline{\varepsilon}}^2\right)\\ {}{\overline{\mu}}_{t+1}={\overline{\mu}}_t+{\overline{v}}_t+{\overline{\eta}}_t& {\eta}_t\sim N\left(0,{\sigma}_{\overline{\eta}}^2\right)\\ {}{\overline{v}}_{t+1}={\overline{v}}_t+{\overline{\xi}}_t& {\xi}_t\sim N\left(0,{\sigma}_{\overline{\xi}}^2\right)\\ {}{\beta}_{t+1}={\beta}_t+{\zeta}_t& {\zeta}_t\sim N\left(0,{\sigma}_{\zeta}^2\right)\end{array}\right. $$
(10)
where βt is the time-varying coefficient on temperature Tt and ρ is the constant coefficient on precipitation Pt.
Once model (10) is estimated, we investigate the statistical characteristics of the resulting yield trend to understand whether there is any change in its stochastic features and in the size of the slope as a result of incorporating weather factors in the model. If any substantial difference emerges compared with the univariate model (1), we attribute it to the influence of weather dynamics, and more specifically to the trend of temperature, given that the latter dominates precipitation as a predictor of crop yield. In particular, we are interested to verify whether any trend stochasticity emerging from the univariate analysis persists in the regression model (10) or whether the trend is effectively deterministic.
Like in Stage 1, we consider four sources of evidence to produce a robust assessment of the statistical nature of the trend, in this case, conditional on the inclusion of our weather variables.
- 1)
We ascertain the gain in fit generated from modelling a stochastic trend by comparing the corresponding value of information criteria with that of a model that has the same specification but features a deterministic trend. Here, we adopt three widely-common information criteria: Akaike (AIC), Schwartz (BIC) and Hannan-Quinn (HQ).
- 2)
We look at the size of the ML estimate of the state error variance and compare it with the one from the univariate model (1) to check whether it has dropped substantively towards zero after the inclusion of the weather regressors.
- 3)
We visually examine the plot of the smoothed state to understand if there are noticeable changes over time, which is a sign of stochastic state.
- 4)
We obtain a final confirmation on the validity of the selected model by running the standard diagnostic tests, built on the standardized innovations of the model. Here, the importance of stochasticity is signalled by a substantial deterioration of any of the three diagnostics when a deterministic trend is imposed.Footnote 7
As the results about the stochasticity of the trend might be dependent on the specific model selected in Stage 1, we also provide a robustness analysis where we use two alternative specifications of a stochastic trend, so that we eventually look at three sets of models:
- 1)
A model with stochastic level and time-varying impact of temperature, i.e. \( {\upsigma}_{\overline{\upeta}}^2\ne 0 \), \( {\upsigma}_{\overline{\upxi}}^2=0 \) and \( {\upsigma}_{\upzeta}^2\ne 0 \);
- 2)
A model with stochastic slope and time-varying impact of temperature, i.e. \( {\upsigma}_{\overline{\upeta}}^2=0 \), \( {\upsigma}_{\overline{\upxi}}^2\ne 0 \) and \( {\upsigma}_{\upzeta}^2\ne 0 \); and
- 3)
A model with stochastic level, stochastic slope and time-varying impact of temperature, i.e. \( {\upsigma}_{\overline{\upeta}}^2\ne 0 \), \( {\upsigma}_{\overline{\upxi}}^2\ne 0 \) and \( {\upsigma}_{\upzeta}^2\ne 0 \).
Finally, we provide a quantitative indicator of the impact of weather on the long-run trend of crop yield by taking the average difference in the smoothed slope obtained from the univariate model (1) and the regression model (10), calculated over the whole period
$$ \frac{1}{T}\sum \limits_{t=1}^T\left({v}_t-{\overline{v}}_t\right) $$
and its relative counterpart
$$ \sum \limits_{t=1}^T\left({v}_t-{\overline{v}}_t\right)/\sum \limits_{t=1}^T{\overline{v}}_t $$
The magnitude of this quantity measures the extent to which the observed long-run annual rate of change in crop yield can be attributed to weather trends, and its sign indicates whether this contribution is positive or negative.