Choosing multiple linear regressions for weather-based crop yield prediction with ABSOLUT v1.2 applied to the districts of Germany

Conradt, Tobias

doi:10.1007/s00484-022-02356-5

Choosing multiple linear regressions for weather-based crop yield prediction with ABSOLUT v1.2 applied to the districts of Germany

Original Paper
Open access
Published: 03 September 2022

Volume 66, pages 2287–2300, (2022)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Biometeorology Aims and scope Submit manuscript

Choosing multiple linear regressions for weather-based crop yield prediction with ABSOLUT v1.2 applied to the districts of Germany

Download PDF

Tobias Conradt ORCID: orcid.org/0000-0001-5341-9794¹

2903 Accesses
5 Citations
Explore all metrics

Abstract

ABSOLUT v1.2 is an adaptive algorithm that uses correlations between time-aggregated weather variables and crop yields for yield prediction. In contrast to conventional regression-based yield prediction methods, a very broad range of possible input features and their combinations are exhaustively tested for maximum explanatory power. Weather variables such as temperature, precipitation, and sunshine duration are aggregated over different seasonal time periods preceding the harvest to 45 potential input features per original variable. In a first step, this large set of features is reduced to those aggregates very probably holding explanatory power for observed yields. The second, computationally demanding step evaluates predictions for all districts with all of their possible combinations. Step three selects those combinations of weather features that showed the highest predictive power across districts. Finally, the district-specific best performing regressions among these are used for actual prediction, and the results are spatially aggregated. To evaluate the new approach, ABSOLUT v1.2 is applied to predict the yields of silage maize, winter wheat, and other major crops in Germany based on two decades of data from about 300 districts. It turned out to be absolutely crucial to not only make out-of-sample predictions (solely based on data excluding the target year to predict) but to also consequently separate training and testing years in the process of feature selection. Otherwise, the prediction accuracy would be over-estimated by far. The question arises whether performances claimed for other statistical modelling examples are often upward-biased through input variable selection disregarding the out-of-sample principle.

LiDAR Data Fusion to Improve Forest Attribute Estimates: A Review

Article Open access 21 June 2024

Enhancing crop recommendation systems with explainable artificial intelligence: a study on agricultural decision-making

Article Open access 11 January 2024

Analysis of factors affecting evapotranspiration zoning

Article 13 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Weather-based crop yield predictions have a long history; correlations between weather variables and agricultural yields had already been studied in the first quarter of the twentieth century (Meinardus 1901; Hooker 1907; Fisher 1924), and estimating regional yields by multiple linear regressions from time-aggregated weather data has been applied for decades. The full spectrum of potentially yield-relevant meteorological averages in varying seasonal time windows is however rarely scrutinized by the existing models; the same holds for landscape-specific weather response patterns of different crops. The algorithm presented in this article attempts to minimize these gaps by extensive regression testing for automatically selecting the most informative weather aggregate combinations on a per-district basis (however guided by multi-district performance), finally aiming for reliable extrapolations in climate change scenario assessments.

The challenge of weather-based yield regressions is hardly in validating the general approach; it usually works and explains a significant part of the observed yield variations. Numerous studies have just demonstrated that for different crops in different locations around the world (e.g. Ceglar et al. 2016; Nemoto et al. 2016; Schauberger et al. 2017b). It is the details of the implementation which matter and finally decide to what extent the unavoidable prediction errors can be reduced. Even resorting to simple linear regression models leaves the modeller with further decisions galore, frequently too easily taken based on personal beliefs but in fact defining the true challenge of research and development with this class of models.

Optimizing the selection of predictor variables defines the focus of this contribution. The “oldschool approach” still taken by many researchers is largely an expert choice. It might have been guided by selective correlation analyses for preselected candidate variables (González-Fernández et al. 2020; Ji et al. 2019), stepwise regression (Kern et al. 2018; Salehnia et al. 2020), or consideration of crop growth stages for suitable time windows (Butts-Wilmsmeyer et al. 2019; Zhang et al. 2017). In some cases, the selection effort is critically flattened, even if continental climate impact assessments are at stake: (Moore and Lobell 2014, 2015) used temperature and precipitation averages of the growing season as sole meteorological basis for that purpose. The challenge of choosing suitable input variables where optional input data are abundant however paves the way to machine learning. As a non-meteorological example, Gómez et al. (2019) had potato yields automatically fitted to 54 spectral bands and indices of Sentinel 2 satellite images by diverse methods (including generalized linear model, quantile regression, support vector machines, and neural networks). While a massive, automated search for the best predictor features and methodology evades modeller subjectivity, explanations for a certain system behaviour can hardly be given, and an eventual gain in predictive performance remains questionable if there is no strict separation between training and testing data (“out-of-sample”).

By no means, the method presented here can be claimed to be the last word on the subject; however, it systematically optimizes the often disregarded input variable selection process while maintaining the principal approach of multivariate linear regressions. Therefore the code shall be named “Assessing Best-predictive Sets fOr multiple Linear regressions throUgh exhaustive Testing”, in short: ABSOLUT. Version 1.0 lacked the consequent out-of-sample processing in the feature and regression selection parts albeit it was observed for regression testing. This was corrected in v1.1, and the current version 1.2 presented here also excludes overlaps of weather features’ time aggregations in the regression formulas.

The ABSOLUT approach can be counted as a kind of brute force machine learning uncommon to regression-based crop yield prediction: Among the 362 crop forecasting studies published in the years 2004–2019 that were evaluated by Schauberger et al. (2020), there were 258 utilizing regression, and only a few dozen implementing established machine learning approaches most of which were automated neural networks (28 cases) followed by random forests (12). This study should therefore serve as proof of concept for bridging the gap between regression and machine learning approaches which may also be transferable to similar setups, e.g. with panel or nonlinear regression models. The working hypothesis is that regression-based modelling can still compete with machine learning approaches if automated optimizations are applied.

Materials

Hard- and software

As ABSOLUT builds on exhaustive searches for optimal feature combinations in linear regressions, parallel execution of the code on a cluster computer using several dozen cores is advisable. A state-of-the-art single PC or notebook would probably need 1–2 weeks for the Germany example given all CPU cores (usually four) are engaged.

The code is written in R (https://www.r-project.org/) and requires version 3.5.1 (used in this study) or newer. It makes use of the extension packages leaps (Lumley 2017; v.3.0) and doMPI (Weston 2017; v.0.2.2). Suggestions for pre- and postprocessing software can be found in a separate “Directions for use” document placed in the code repository (Conradt, 2021b).

Input data

Any ABSOLUT application needs a domain divided into spatial subunits, henceforth called districts, for which there are individual crop yield time series and monthly weather data available. A large number of districts (ideally more than 100) and many years with yield data (preferably more than 20) are required for the selection of valid regression feature combinations.

Monthly weather data are needed for each district and should spatially correspond to the agricultural areas within them. The weather data should start at least 1 year before the yield records start and end not earlier than with the growing season of the final year covered by the yield data; otherwise, not all yield information can be considered. In detail, the modeller fixes the last calendar month whose weather data are to be considered for the growing seasons, and the model will evaluate weather–yield correlations in the 12-month periods ending with this month. For example, if there are yield data for the years 1996–2015 and this “cut-off month” is set to May, the weather data must cover the period from June 1995 to May 2015 to utilize the complete yield information.

As it is not necessary to include the very end of the growing season (the actual harvest dates shift between years anyway), a timely cut-off and weather data reaching far enough into the current year can be used for pre-harvest yield forecasts (Schauberger et al. 2017b, regarding the effects of shortened weather input). All data are to be provided in form of ASCII tables; see the example files in the data repository (Conradt, 2021a).

Specifics of the example application

Germany is a challenging test bed for agricultural modelling due to the heterogeneity of its landscapes and cropping conditions; for a brief geographical description, see section S1 in the Supplement. The spatial basis of the example is Germany’s 401 district-level subdivisions (including the city states of Hamburg and Berlin as single units) as they existed on 1st January 2018. The time frame was principally determined by the district-wise crop data covering the years 1999–2020; yield predictions for 2021 were however possible through more recent weather data. Observed yields for 2021 and further weather data allowing first 2022 predictions became available shortly before publication of this article and were considered in Table 2.

Primary data from external sources

The district geometries were taken from the official 1: 1 million digital map of administrative areas, status 1st January 2018, published by the German Federal Agency for Cartography and Geodesy (BKG 2018). Crop yield data are provided by the Statistical Offices of Germany (Statistische 2021b; DESTATIS, 1982ff). National, state, and district level data were obtained for ten crop species in the harvest years 1999–2020. Additionally, district-wise crop growing areas (Statistische 2021a) were obtained for the year 2016 and applied as weighting factors in aggregating the estimated district yields to state and national averages. Monthly weather data were obtained from the Climate Data Centre of the German Weather Service (DWD). Point of departure was their monthly 1-km grids of meteorological variables for the years 1998–2021 (DWD 2021).

The spatial distribution of agricultural areas within the districts has to be taken into account for determining the locally relevant weather conditions, especially for districts including both an agricultural lowland and a mountainous part (Conradt et al. 2016). The 2012 Corine Land Cover data (Copernicus 2020) were used for this purpose. Details of the necessary preprocessing are given in S2.2. Respectively prepared input data as used for the example application have been published (Conradt, 2021a).

Methods

At its core, the ABSOLUT algorithm applies multiple linear regressions of the form

$$y\left(t\right) = \alpha + {\beta }_{0}t +\sum\limits_{i=1}^{d\in \left\{0,\dots ,4\right\}}{\beta }_{i}{w}_{i,t} + \varepsilon$$

(1)

for each spatial subunit (“district”) of the target domain. Herein, y(t) is the yield, i.e. the harvested mass per area, in dt ha⁻¹ of a certain crop in the year t; α is the intercept; and the β are the other regression parameters: There is a linear basis trend over time β₀t, and there can be up to four additional terms for aggregated weather variables w_i,t. Such an aggregated weather variable could, for instance, be the precipitation sum of December, January, and February preceding the harvest in the summer of t. These aggregates are henceforth called weather features to avoid confusion with weather variables like temperature or precipitation in general. The closing ε is the estimation error to minimize.

The decision to limit the number of weather features d to a maximum of four was guided by practical experience with statistical yield model performance. Including more features would not necessarily increase prediction accuracies but come with an even more extensive computational demand for testing myriads of possible feature combinations. Using less features however provides more reliable regression parameter estimations and can be a better choice for certain years and districts. The automatic consideration of different numbers of weather features in Eq. 1 was introduced with v1.2; former versions were hard-wired to d = 4.

The R code of ABSOLUT is freely available (Conradt, 2021b) and consists of five programs that have to be run subsequently. The first three of them determine the weather features to be used in the finally selected district models. The sectional workflow, delineated in Fig. 1, is partly owing to different stages of code development and was kept to allow for checks into the intermediate output/input files. The following subsections briefly describe the purposes and main features of the programs; further details are given in S3.

The five steps of ABSOLUT

Program 1: “the prospector”

This initial step is principally exhaustive input feature testing for obtaining best-fitting multiple linear regressions. Although the results cannot be used for predictive purposes directly, it contributes to narrowing the search window for regressions of higher predictive capability. Time aggregates of the chosen weather variables are calculated for periods of 2 to 6 months taken from the 12 months before harvest; these are called weather features. Using all possible start months, this means 11 different 2-month features per variable, ten 3-month features, nine 4-month features and so on, in total 45 features per weather variable, each with one value per year.

The better purpose of program 1 is to subset the pool of possible regression features to those very probably containing predictive power, and this is based on counting the number of occurrences of weather features among the best-fitting regressions. Accordingly, all weather features used in these regressions are sorted along their frequencies of use, and a relevance cutoff is determined based on binomial probabilities—features need to have been selected more often then they would have occurred by pure chance with 99.9% probability; the number of features above this threshold is henceforth denoted q. Details of the calculation are explained with the case study example below.

Note that all calculations made in this and the following programs are done separately for each target year, so that any feature selection and prediction is made solely based on information from other years plus the meteorological data of the target year. Only for current and future year projections (scenarios), all available data from the past are utilized.

Program 2: “the workhorse”

Program 2 effects the highest computational burden, using parallelisation is strongly recommended. Again, possible input feature combinations are investigated for their predictive power in multiple linear regressions according to Eq. 1, but the weather features are now chosen from the target year-specific preselections of significant features provided by program 1. Consequently, only predictions for these years whose yield data are censored from the regression equations (leave-one-out) are calculated. What is demanded here is an evaluation of the predictive skill of the input feature combinations, not just single features. This is done by Pearson correlations (r values) between reported yields and the out-of-sample yield predictions from the regressions.

Program 3: “the gold pan”

The logically following task addressed by this program is determining the optimal regression model for each spatial subunit. Simply choosing the feature combinations separately for each district from top of their local r ranking (“local heroes”) could however be misleading because a single yield time series estimation provides rather insufficient validation; any chosen combination should be cross-validated by above-average performances in many districts.

The solution currently implemented (“global and local heroes”) merges globally best performing combinations (highest average r values) with those working exceptionally well in smaller subsets, down to 10% of the districts. The idea behind is to account for special conditions in certain landscapes. The locally best-performing regression out of this third selection is finally implemented for each district.

Programs 4 and 5: “crucible and mould”

What remains is using the selected regression equations for yield prediction in the respective target years; this is done by program 4. Program 5 aggregates the district yield predictions by weighted averaging to predictions for the full modelling domain. Spatial aggregation provides higher prediction accuracy due to mutual error compensation among noisy district results.

Setup of the Germany application

Three principal weather variables were selected: average temperature, precipitation, and sunshine duration. Temperature governs the physiological processes in plant growth and fertility. Sunshine is used as proxy for radiation, the energy source for photosynthesis. Precipitation and radiation (the main driver for evapotranspiration) are finally decisive for water stress. There are actual radiation data available, but sunshine duration is measured at more locations and regularly delivered better results. Applying ABSOLUT to other world regions and crop species may require other meteorological variables for optimal results, and data availability has always to be considered. Even non-meteorological variables would be acceptable, but here, we focus on weather effects. The minimum of yield data per district had been set to 17 because the example dataset was limited to 22 years (1999–2020), and a higher requirement would have excluded many districts with incomplete observations.

The primary test crop was winter wheat, and the last month for weather input before each year’s harvest (typically in July or August) was set to June. For the secondary test crop silage maize, the weather input season was set to end by August; maize harvest may occur late in the year but growth stagnates in autumn.

Results

Observations along the workflow for winter wheat

Running program 1

With the three weather variables 3 · 45 = 135 weather features were generated. This meant $\left(\begin{array}{c}135\\ 4\end{array}\right) = 13\;232\;835$ different regressions per district and target year to be tested which required a couple minutes using 24 CPUs in parallel.

There is a stark difference between the goodness-of-fit of the top-ranked regressions applied to the same data that was used for selecting them and their performance in validation mode, i.e. with the observed yield value of any single year to predict censored from the input (out-of-sample validation across all target years). The respective r² values for winter wheat averaged over all district models are 0.878 and 0.115, Fig. S2 shows maps of the the spatial distributions. It is clear that the regressions selected in this step cannot be immediately used for predictions.

Which weather features do however appear significantly often in these regressions, each target-year collection including the top 23 per district? The algorithm requires that the number of occurrences exceed a frequency expected by pure chance with 99.9% confidence. If there were pure noise in the data, each of the 135 features originally provided would turn up with a constant probability of $p = \frac{4}{135}$ per regression sample. With a finite number of samples their frequencies follow a binomial distribution. In the winter wheat case, there were n = 7498 samples (326 districts times 23) per target year; thus, the expectation value for any weather feature in the noninformative case would be E(x) = np = 222.163 occurrences with an expected standard deviation of $E\left(\sigma \right) = \sqrt{np\left(1-p\right)} = 14.683$. The number of occurrences not to be exceeded in 99.9% of the cases would be P₉₉₉(n,p) = 269. Depending on the actual frequencies in the separate target-year outputs, between 29 and 36 weather features were selected in the winter wheat case; Fig. 2 shows the selections and frequencies for four target years.

The average temperature towards the end of the growing season (temperature aggregates for May and June and March to June) stick out clearly for each target year; this is in full agreement to the often described temperature sensitivity of wheat during and after anthesis (Akter and Islam 2017; Farooq et al. 2011; Schauberger et al. 2017a). Consequently, the majority of the aggregates shown in Fig. 2 are temperature averages (magenta), while less than a quarter are precipitation depths (cyan) and only 18 out of 134 are sunshine durations (orange). The frequency of selections depends strongly on which year was omitted from the input data, but there are also quite stable and interesting patterns: For example, sunshine duration in July and August (in the pre-harvest years, before sowing!) is selected for every year shown in Fig. 2 and in 21 of the 23 target years considered in total.

Running programs 2 and 3

The 23 target-year specific output tables of program 2 have between A₂₀₀₄ = 4050 and A₂₀₀₀ = 16 216 lines (below the header) for all possible input variable combinations and 326 columns representing the districts for which enough yield data were available. Negative correlation coefficients could be found in 28.3% of the table cells. The negative extremes are near-perfect anticorrelations; eight target years had r_min < − 0.97.

Given the number of 22 out-of-sample regression estimates behind every correlation coefficient, the computational demand is significant. Running program 2 on 112 CPU cores took about 23 h for winter wheat. Calculations for other crops led to fewer pre-selected input weather features and could be done in a few hours, though. It should also be noted that the major numerical effort is done with program 2, the remaining code parts only take a couple minutes to complete on a single CPU.

Even for the generally best performing feature combinations, the individual per-district correlations between out-of-sample predictions and observed yields are rather noisy. Consequently, 258–290 different combinations make up the individually best-performing regressions for the 326 districts considered in the winter wheat case, and their average correlations of 0.717–0.766 still deteriorate when used for predictions based on new data.

The input feature combinations leading to the highest out-of-sample average correlations (averaged over all spatial subunits) are determined by program 3. The top-ranked combination for each target year is listed in Table 1: Note that tas0506 is frequently included, while tas0306 does not occur at all here despite the fundamental positioning of both features in Fig. 2.

Table 1 Target year-specific combinations of input weather aggregates performing best across all districts and the average Pearson correlation of their out-of-sample predictions in the winter wheat example for Germany; output of program 3. The input feature tas0506 is bolded to highlight its many occurrences

Full size table

The “global and local heroes” selection was finally applied to determine the target-year specific sets of predictors; see S4 for the analysis of the alternative selection methods. For most years, 12–16 combinations were actually applied, the maximum was c_2004,2005 = 19, and the minimum c₁₉₉₉ = 9. These combinations contained 12–17 different weather features, and the correlation averages of the so determined district models were in the range of 0.572–0.650; these numbers are a more realistic indication of the expectable prediction performance.

Running programs 4 and 5

Program 4—utilizing the out-of-sample-determined district-specific regressions for the out-of-sample yield predictions—loops through all districts within a minute. Their spatial time-series aggregation towards Germany’s national winter wheat yields, computed by program 5, is shown in Fig. 3 together with the results for silage maize. In contrast to the intermediate aggregation for the federal state of Saxony (Fig. S3), the national estimates for winter wheat yields expose higher noise but reduced errors (R²_val = 0.417, RMSE = 4.58 dt ha⁻¹), while the spatially aggregated silage maize modelling shows remarkable accuracy (R²_val = 0.837, RMSE = 13.9 dt ha⁻¹).

Prediction performance

Regional performance for 2018 silage maize yields

Using district regressions provides not only a basis for aggregate results but can also help identify spatial patterns despite the higher noise in single district results. In contrast to major agricultural zones of the world like the North China Plain or the US corn belt, Germany is characterized by a high diversity of soil landscapes forming a distinct pattern of high and low yield regions (Hennings 2013; Kruse 2016); hence, the ability to predict spatial yield patterns should be determined from relative changes as shown in the upper map panels of Fig. 4.

The silage maize harvest forecast for the drought year 2018 was chosen as an example due to extreme yield losses concentrating in Eastern Germany. The S5 section in the Supplement presents respective maps for the 2019 yield changes of winter wheat, sample statistics of both cases, and observations from the error compensation for spatial aggregates.

Despite the noise, the maps of the predicted and observed relative yield changes in districts shown in Fig. 4 (a and b) show a general similarity of spatial patterns. There are regional misses especially in north-western parts of the country and along the southern border, but the centre of gravity of the strongest yield losses could correctly be located. Panel (c) gives an impression of the regional distribution of prediction power through absolute RMSE values calculated from the complete record of 1999–2020 out-of-sample prediction errors. In general, the method works fine in Northern Germany and some parts of the south, but has some issues in western to central areas. The actual 2018 forecast errors (d) were much larger in many districts including those with comparably small RMSEs. It may be assumed that the training period (1999–2017) did not contain enough reference drought years; in fact, only 2003 might have been comparable to some extent.

Yield predictions for Germany

There are at least two institutions regularly publishing crop yield forecasts or estimates: the German Federal Statistical Office (Statistisches Bundesamt, DESTATIS) and the Joint Research Centre (JRC) of the European Commission with their MARS (Monitoring Agricultural ResourceS) activity. The DESTATIS reports (DESTATIS, 1982ff) with national and federal states’ estimates are based on extensive field monitoring, on-site observations during growth and harvest by farmers and travelling experts. The MARS forecasts utilize a number of sources and predictors but seem to largely rely on remote sensing. National aggregate predictions are released via monthly bulletins (MARS 1993ff).

Yield predictions for crops harvested in June or July (cereals or rape) can usually be computed in the beginning of July as soon as the monthly weather data for June are available. For silage maize or sugar beets, the August weather data should be completed. Both conditions were constantly observed for the examples presented. Hence, the preliminary national crop yield estimations of DESTATIS, regularly published in the beginning of August and at the end of September, are used for comparison. Only the winter barley estimates are released in between (which explains their small deviations from the final figures), and DESTATIS does not provide early indications at all for sugar beet yields. For MARS, the annual issues 7 (typically released at the end of July) and 9 (typically released in mid-September) are the seasonal counterparts to compare with. Table 2 compares the official national yields with the different predictions for five crops in the years 2018–2021 and also shows some forecasts for 2022.

Table 2 National average yields for various crops: comparison of official statistics (yield) to near-harvest predictions of ABSOLUT, the German Federal Statistical Office (DESTATIS, 1982ff), and the European Commission’s Joint Research Centre (MARS, 1993ff)

Full size table

Weather input of Gornott and Wechsung

Experiments with district-based crop yield prediction in Germany through multiple regression using weather aggregates had already been presented by Gornott and Wechsung (2016). In contrast to the algorithm presented here, only year-on-year (YoY) changes were considered saving the explicit estimation of an underlying linear trend. The study investigated different options to couple the coefficients of the district models (panelling), an approach further pursued with cluster analysis (Conradt et al. 2016). The nearest equivalent to ABSOLUT are therefore the independent district regressions of Gornott and Wechsung (2016), called there “separate time series models” (STSMs), and the most fundamental difference is that all STSMs used the same set of input variables—predefined per crop—while ABSOLUT searches for some optimal combinations.

How powerful are the predefined input variables in terms of prediction accuracy compared to the combinations drawn by ABSOLUT? To answer this question, the district regressions were charged with the weather variables originally used by Gornott and Wechsung (2016); implementation details are given in S7. The time series for national yield predictions calculated from the Gornott and Wechsung weather aggregates are shown in Fig. 5. Compared to the result of the ABSOLUT algorithm in Fig. 3, the lower accuracy is evident; the shares of explained interannual winter wheat yield variability dropped from 41.7% to mere 18.6%. Note that the wheat predictions resemble the observed ups and downs predominantly in the first half of the time; from about 2010 onwards, there is hardly any correlation any more. The coefficient of determination for the national silage maize yield predictions reaches at least 42.7%, while Gornott and Wechsung (2016) reported 50% for their prediction of interannual changes. This is however clearly below the 83.7% obtained with ABSOLUT. Maps showing the spatial goodness-of-fit distributions can be found in the Supplement (Figs S5 and S6).

Discussion

Performance in comparison to previous studies and official yield predictions

The first lesson learned was that the excessive testing and optimization of regressor combinations consumes degrees of freedom, thus predictive power, just like the estimation of many coefficients within the multiple regressions. The solution was to require significant above-average performances in many districts for qualifying combinations of weather aggregates (input features) as predictors: Their performance exceeded the results obtained with pre-defined weather features used in precursor studies (Gornott and Wechsung 2016; Conradt et al. 2016).

Considering the fact that the ABSOLUT results can be obtained 2–4 weeks in advance to those of the other sources, the quality of its predictions of national yield averages is in the same league with DESTATIS and MARS. The ABSOLUT regressions produced tendentially more overpredictions, probably due to uncaptured drought effects (especially soil drought). The explained shares of yield variations are in accordance with the literature: Global studies assessing the relative impact of weather factors on crop yield variations (Frieler et al. 2017; Schauberger et al. 2017b) give typical ranges of 50–60% for wheat and maize yields of main producer countries; for wheat in the USA, only 30–40% were reported as well. A careful assessment separating the impacts of farm management and weather effects on wheat yield variations across Germany (Albers et al. 2017) found average shares of 43% of the variations caused by the weather and 49% owing to management. While non-meteorological factors like irrigation status, fertilizer price, and general farming conditions are much more decisive in developing countries (Assefa et al. 2020), national aggregate yields of staple crops in Europe may depend even more on weather than previously assumed (Agnolucci and De Lipsis 2020). To tap the full potential of weather-based yield modelling, meteorological extremes (heat waves, storm precipitation) need however also to be considered; by using only time aggregations over several months, this is not possible.

Regarding possible COVID-19 effects in the official 2020 and 2021 yield data, farmers’ cropping operations had been done as usual in Europe (no COVID restrictions for single-driver machines). Only pandemic-induced micrometeorological effects not reflected in the weather data like sunshine intensity and alterations in air chemistry (less NO₂, more O₃) caused by reduced air pollution (Skirienė and StasiškienėŽ 2021, Torkmahalleh et al. 2021; Silva et al. 2022) may have affected the observed yields. However, no literature specifically devoted to lockdown effects on plant growth could be found, so these effects are probably hardly traceable.

An over-confidence trap in statistical modelling

The most important improvement of ABSOLUT v1.2 over its predecessor v1.0 is that not only the parameter estimations used for prediction are solely based on weather–yield relations in other years than the the target year (out-of-sample) but also the search for the weather features to be applied. Originally, the feature combinations for the district regressions were fixed once and for all based on the full dataset. The biased R²_val indications of v1.0 reached more than 0.8 for the national winter wheat yield time series, and silage maize results were even breaking 0.9.

The performance indications have now been corrected by the consequent separation of training and testing data for all aspects of feature selection and parameter estimation. Forecast and scenario outputs were hardly affected by the correction—all program versions use all available data from the past for predictions beyond the coverage of the observed yield time series. How much the performance measures had been upward biased revealed the enormous information content hidden in the selection of input feature combinations.

This is important for any related kind of statistical modelling: Especially if the selection of input variables is less freely adaptive but guided by expert knowledge, it may happen quite often that the resulting model performs seemingly well, but only in the environment in which it was developed. With the recent input data, the historical prediction performances of Gornott and Wechsung (2016) (cf. also Conradt et al. 2016) could be reconstructed to some extent for silage maize but hardly at all for winter wheat. An interpretation is that the formerly observed correlations between uniformly defined weather variables and wheat yield deteriorate under climate change and influential weather variables become gradually replaced by others (to which the ABSOLUT algorithm will automatically adopt). Given the rather abrupt loss of correlation in Fig. 5 after 2010, the final year considered by Gornott and Wechsung (2016), it can as well be assumed that their input variable selection was (unconsciously?) guided by its performance in the historic environment, and any data from outside the original time window would spoil the original correlation even without climate change.

The question remains, to what extent information absorbed in the model setup process and henceforth contained in model structures makes overconfidence in predictions from new input data a common issue, not only in weather-based crop yield modelling. In recent years, machine learning algorithms gained popularity in crop modelling with seemingly better results than multiple regression modelling (Cai et al. 2019; Cao et al. 2021; Leng and Hall 2020; Zhang et al. 2020; Bouras et al. 2021). However, practically all of these studies have in common that an initial selection of predictor variables was made using all available data (and often simple tools like Pearson correlations) before the advanced methods were applied.

Improvement potentials and development opportunities

There are two critical spots of the present version of the algorithm which are at the same time opportunities for improvements with future revisions: The first one is the assumption of a linear base trend of yields independently estimated for each spatial subunit. This allows straightforward prediction of absolute yields instead of relative changes (a major difference to Gornott and Wechsung 2016, and Conradt et al. 2016), but might be oversimplistic: After several decades of technological progress with ever increasing yields, there are stagnations reported for different crops and world regions (Chen 2018; Schauberger et al. 2018; Mehrabi and Ramankutty 2019). Already existing methods like the stochastic trend separation by Agnolucci and De Lipsis 2020) highlight the potential for improvement.

The second area of concern is the final selection of independent regressor variables, i.e. weather aggregates, for each spatial subunit. The general challenge is about finding the optimum balance between a high number of multi-site confirmations of predictive power and the flexibility needed to adopt to smaller regions demanding alternative combinations for more exact predictions. Perhaps spatial clusters of predictors should be explicitly considered similarly to what has been done for parameter values (Cai et al. 2014; Conradt et al. 2016). Finally, there is also no stability of the weather–yield relations over time, probably caused by nonstationarity of meteorological variables in the context of climate change: Correlation shifts have been observed by Trnka et al. (2016) or Ceglar et al. (2020) which calls for additional flexibility.

This flexibility could also be connected to shifts in the phenological calendar, a well-known effect of climate change (Racca et al. 2015; Zhang et al. 2022). Shifting growth stages could probably be considered by shifting time windows for weather feature aggregation which in turn would require a finer time resolution of the weather input data, e.g. decadal instead of monthly data.

Conclusions

It could be demonstrated that the ABSOLUT algorithm, already in its present stage of development, is capable of explaining significant shares of the national yield variations of major crops in Germany solely based on weather variables. Given the near real-time availability of German weather data, early in-season yield predictions are possible with accuracies comparable to official national and EU forecasts.

Probably the most important finding was the “overconfidence trap” for any kind of regression modelling with expert-guided regressor selection: As the choice of regressors contains a similar amount of information as the parameter values do, it is very easy to unvoluntarily violate the principle of independent model training and testing. Many performance figures given in the literature for statistical yield models may be positively biased for that reason. The algorithm presented here tries to escape this trap by objectivizing the regressor selection; however, some basic choices for relevant meteorological variables still remain with the modeller.

Primarily developed for demonstrating the feasibility and principal advantage of semi-automatic regression feature selection, ABSOLUT offers many potentials for improvements. Among these is the capability to capture nonlinear long-term yield trends or a better way to balance temporal and spatial correlations in the input data. A weak spot shared with other regression models is the time aggregations blinding the model for exceptionality and effects of (short-time) weather extremes which become more frequent under climate change. Consequently, related questions about the impact of climate change on food security underline the need for further research into this field.

Data availability

The input data needed for the example application to Germany consist of crop yields and cropping areas in administrative regions, district-level monthly weather data, and a control file. They are publicly available at https://doi.org/10.5281/zenodo.5625774 (Conradt, 2021a).

Code availability

The model code, consisting of five R programs, is publicly available at https://doi.org/10.5281/zenodo.5789350 (Conradt, 2021b).

References

Agnolucci P, De Lipsis V (2020) Long-run trend in agricultural yield and climatic factors in Europe. Clim Change 159(3):385–405. https://doi.org/10.1007/s10584-019-02622-3
Article Google Scholar
Akter N, Islam R (2017) Heat stress effects and management in wheat. A Rev Agron Sustain Dev 37(5):37. https://doi.org/10.1007/s13593-017-0443-9
Article CAS Google Scholar
Albers H, Gornott C, Hüttel S (2017) How do inputs and weather drive wheat yield volatility? The example of Germany. Food Policy 70:50–61. https://doi.org/10.1016/j.foodpol.2017.05.001
Article Google Scholar
Assefa BT, Chamberlin J, Reidsma P et al (2020) Unravelling the variability and causes of smallholder maize yield gaps in Ethiopia. Food Secur 12(1):83–103. https://doi.org/10.1007/s12571-019-00981-4
Article Google Scholar
BKG (2018) Verwaltungsgebiete [administrative areas] 1 : 1 000 000 as of 1 January 2018. Vector geodata (shapefiles) incl. documentation. GeoBasis-DE / Bundesamt für Kartographie und Geodäsie [Federal Agency for Cartography and Geodesy], Leipzig, https://daten.gdz.bkg.bund.de/produkte/vg/vg1000ebenen0101/2018/vg100001-01.lamgw.shape.ebenen.zip, last access January 2022
Bouras EH, Jarlan L, Er-Raki S et al (2021) Cereal yield forecasting with satellite drought-based indices, weather data and regional climate indices using machine learning in Morocco. Remote Sens 13(16):3101. https://doi.org/10.3390/rs13163101
Article Google Scholar
Butts-Wilmsmeyer CJ, Seebauer JR, Singleton L et al (2019) Weather during key growth stages explains grain quality and yield of maize. Agronomy 9(1):16. https://doi.org/10.3390/agronomy9010016
Article Google Scholar
Cai Y, Guan K, Lobell D et al (2019) Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches. Agric for Meteorol 274:144–159. https://doi.org/10.1016/j.agrformet.2019.03.010
Article Google Scholar
Cai R, Yu D, Oppenheimer M (2014) Estimating the spatially varying responses of corn yields to weather variations using geographically weighted panel regression. J Agric Resour Econ 39(2):230–252. https://ageconsearch.umn.edu/record/186586/files/JAREAug20146Caipp230-252.pdf
Cao J, Zhang Z, Tao F et al (2021) Integrating multi-source data for rice yield prediction across China using machine learning and deep learning approaches. Agric for Meteorol 297(108):275. https://doi.org/10.1016/j.agrformet.2020.108275
Article Google Scholar
Ceglar A, Toreti A, Lecerf R et al (2016) Impact of meteorological drivers on regional inter-annual crop yield variability in France. Agric for Meteorol 216:58–67. https://doi.org/10.1016/j.agrformet.2015.10.004
Article Google Scholar
Ceglar A, Zampieri M, Gonzalez-Reviriego N et al (2020) Time-varying impact of climate on maize and wheat yields in France since 1900. Envir Res Lett 15(9):094039. https://doi.org/10.1088/1748-9326/aba1be
Article Google Scholar
Chen H (2018) The spatial patterns in long-term temporal trends of three major crops’ yields in Japan. Plant Prod Sci 21(3):177–185. https://doi.org/10.1080/1343943X.2018.1459752
Article Google Scholar
Conradt T (2021a) ABSOLUT input data for an example application on the districts of Germany (v1.1). Zenodo, 10.5281/zenodo.5625774
Conradt T (2021b) ABSOLUT R programs (v1.2). Zenodo, 10.5281/zenodo.5789350
Conradt T, Gornott C, Wechsung F (2016) Extending and improving regionalized winter wheat and silage maize yield regression models for Germany: enhancing the predictive skill by panel definition through cluster analysis. Agric for Meteorol 216:68–81. https://doi.org/10.1016/j.agrformet.2015.10.003
Article Google Scholar
Copernicus (2020) Corine Land Cover (CLC) 2012, version 2020_20u1. GeoTIFF raster with 100 m resolution. European Union, Copernicus Land Monitoring Service, European Environmant Agency (EEA), Brussels and Copenhagen, https://land.copernicus.eu/pan-european/corine-land-cover/clc-2012?tab=download, last access January 2022
DESTATIS (1982ff) Wachstum und Ernte – Feldfrüchte. Fachserie [Thematic report series] 3, 3.2.1, annual volumes (esp. nos 16), Statistisches Bundesamt, Wiesbaden, https://www.statistischebibliothek.de/mir/receive/DESeriemods00000335, last access August 2022
DWD (2021) Grids of monthly averaged daily air temprature (2m), monthly total precipitation, and monthly total sunshine duration over Germany, version v1.0. ASCII grids, spatial resolution 1 km. DWD Climate Data Center (CDC), Deutscher Wetterdienst (DWD), Offenbach, https://opendata.dwd.de/climateenvironment/CDC/gridsgermany/monthly/, last access January 2022
Farooq M, Bramley H, Palta JA et al (2011) Heat stress in wheat during reproductive and grain-filling phases. Crit Rev Plant Sci 30(6):491–507. https://doi.org/10.1080/07352689.2011.615687
Article Google Scholar
Fisher RA (1924) The influence of rainfall on the yield of wheat at Rothamsted. Philos Trans R Soc B 213(404):89–142. https://doi.org/10.1098/rstb.1925.0003
Article Google Scholar
Frieler K, Schauberger B, Arneth A et al (2017) Understanding the weather signal in national crop-yield variability. Earth’s Future 5(6):605–616. https://doi.org/10.1002/2016EF000525
Article Google Scholar
Gómez D, Salvador P, Sanz J et al (2019) Potato yield prediction using machine learning techniques and Sentinel 2 data. Remote Sens 11(15):1745. https://doi.org/10.3390/rs11151745
Article Google Scholar
González-Fernández E, Piña-Rey A, Fernández-González M et al (2020) Prediction of grapevine yield based on reproductive variables and the influence of meteorological conditions. Agronomy 10(5):714. https://doi.org/10.3390/agronomy10050714
Article Google Scholar
Gornott C, Wechsung F (2016) Statistical regression models for assessing climate impacts on crop yields: a validation study for winter wheat and silage maize in Germany. Agric for Meteorol 217:89–100. https://doi.org/10.1016/j.agrformet.2015.10.005
Article Google Scholar
Hennings V (2013) Ackerbauliches Ertragspotential der Böden in Deutschland 1 : 1 000 000 (SQR1000). 1 : 1 million map, Bundesanstalt für Geowissenschaften und Rohstoffe, Hannover, Germany, https://www.bgr.bund.de/DE/Themen/Boden/Ressourcenbewertung/Ertragspotential/Ertragspotentialnode.html, last accessed January 2022
Hooker RH (1907) Correlation of the weather and crops. J R Stat Soc 70(1):1–51. https://doi.org/10.2307/2339501
Article Google Scholar
Ji Y, Zhou G, Wang L et al (2019) Identifying climate risk causing maize (Zea mays L.) yield fluctuation by time-series data. Nat Hazards 96(3):1213–1222. https://doi.org/10.1007/s11069-019-03605-4
Article Google Scholar
Kern A, Barcza Z, Marjanović H et al (2018) Statistical modelling of crop yield in Central Europe using climate data and remote sensing vegetation indices. Agric for Meteorol 260–261:300–320. https://doi.org/10.1016/j.agrformet.2018.06.009
Article Google Scholar
Kruse K (ed) (2016) Bodenatlas Deutschland. Bundesanstalt für Geowissenschaften und Rohstoffe, Schweizerbart, Stuttgart, Germany
Leng G, Hall JW (2020) Predicting spatial and temporal variability in crop yields: an inter-comparison of machine learning, regression and process-based models. Environ Res Lett 15(044):027. https://doi.org/10.1088/1748-9326/ab7b24
Article Google Scholar
Lumley T (2017) Leaps: Regression subset selection. R package version 3.0. Based on Fortran code by Alan Miller. https://CRAN.R-project.org/package=leaps – Canonical link to current version, v.3.0 accessible through archive link.
MARS (1993ff) Crop monitoring in Europe. JRC MARS Bulletin annual volumes (nos 7, 9), Joint Research Centre of the European Commission, Ispra, https://ec.europa.eu/jrc/en/mars/bulletins, last access August 2022
Mehrabi Z, Ramankutty N (2019) Synchronized failure of global crop production. Nat Ecol Evol 3(5):780–786. https://doi.org/10.1038/s41559-019-0862-x
Article Google Scholar
Meinardus W (1901) Einige Beziehungen zwischen der Witterung und den Ernteerträgen in Nord-Deutschland. In: Verhandlungen des Siebenten Internationalen Geographen-Kongresses, Berlin, 1899. Sampson Low & Co., W. H. Kühl, and H. Le Sondier, London, Berlin, and Paris, pp II, 421–428, https://archive.org/details/verhandlungende19unkngoog/page/n457/mode/1up, last accessed in January 2022, scan lacks reproduction of tables.
Moore FC, Lobell DB (2014) Adaptation potential of European agriculture in response to climate change. Nat Clim Change 4(7):610–614. https://doi.org/10.1038/nclimate2228
Article Google Scholar
Moore FC, Lobell DB (2015) The fingerprint of climate trends on European crop yields. Proc Natl Acad Sci USA 112(9):2670–2675. https://doi.org/10.1073/pnas.1409606112
Article CAS Google Scholar
Nemoto M, Hamasaki T, Matsuba S et al (2016) Estimation of rice yield components with meteorological elements divided according to developmental stages. J Agric Meteorol 72(3–4):128–141. https://doi.org/10.2480/agrmet.D-15-00017
Article Google Scholar
Racca P, Kakau J, Kleinhenz B, Kuhn C (2015) Impact of climate change on the phenological development of winter wheat, sugar beet and winter oilseed rape in lower saxony. Germany J Plant Dis Prot 122(1):16–27. https://doi.org/10.1007/BF03356526
Article Google Scholar
Salehnia N, Salehnia N, Torshizi AS et al (2020) Rainfed wheat (Triticum aestivum L.) yield prediction using economical, meteorological, and drought indicators through pooled panel data and statistical downscaling. Ecol Indic 111:105991. https://doi.org/10.1016/j.ecolind.2019.105991
Article Google Scholar
Schauberger B, Archontoulis S, Arneth A et al (2017) Consistent negative response of US crops to high temperatures in observations and crop models. Nat Commun 8(13):931. https://doi.org/10.1038/ncomms13931
Article CAS Google Scholar
Schauberger B, Gornott C, Wechsung F (2017) Global evaluation of a semiempirical model for yield anomalies and application to within-season yield forecasting. Glob Change Biol 23(11):4750–4764. https://doi.org/10.1111/gcb.13738
Article Google Scholar
Schauberger B, Ben-Ari T, Makowski D et al (2018) Yield trends, variability and stagnation analysis of major crops in France over more than a century. Sci Rep 8(16):865. https://doi.org/10.1038/s41598-018-35351-1
Article CAS Google Scholar
Schauberger B, Jägermeyr J, Gornott C (2020) A systematic review of local to regional forecasting approaches and frequently used data resources. Eur J Agron 120(126):153. https://doi.org/10.1016/j.eja.2020.126153
Article Google Scholar
Silva ACT, Branco PTBS, Sousa SIV (2022) Impact of COVID-19 pandemic on air quality: a systematic review. Int J Environ Res Public Health 19(4):1950. https://doi.org/10.3390/ijerph19041950
Article CAS Google Scholar
Skirienė AF, Stasiškienė Ž (2021) COVID-19 and air pollution: measuring pandemic impact to air quality in five European countries. Atmosphere 12(3):290. https://doi.org/10.3390/atmos12030290
Article CAS Google Scholar
Statistische Ämter (2021a) Table 41141–01–01–4: Landwirtschaftliche betriebe und deren landwirtschaftlich genutzte fläche (lf) nach kulturarten – jahr – regionale tiefe: Kreise und krfr. städte. Downloadable data table. Regional statistical data base, Statistische Ämter des Bundes und der Länder, Stuttgart, https://www.regionalstatistik.de/genesis//online?operation=table&code=41141-01-01-4&bypass=true#abreadcrumb, revision of 2021a, Last access January 2022
Statistische Ämter (2021b) Table 41241–01–03–4: Erträge ausgewählter landwirtschaftlicher feldfrüchte – jahressumme – regionale tiefe: Kreise und krfr. städte. Downloadable data table. Regional statistical data base, Statistische Ämter des Bundes und der Länder, Stuttgart, https://www.regionalstatistik.de/genesis//online?operation=table&code=41241-01-03-4&bypass=true#abreadcrumb, revision of 2021b, Last access January 2022
Torkmahalleh MA, Akhmetvaliyeva Z, Omran AD, Omran FD, Kazemitabar M et al (2021) Global air quality and COVID-19 pandemic: do we breathe cleaner air? Aerosol Air Quality Res 21(4):200567. https://doi.org/10.4209/aaqr.200567
Article CAS Google Scholar
Trnka M, Olesen JE, Kersebaum KC et al (2016) Changing regional weather–crop yield relationships across Europe between 1901 and 2012. Clim Res 70(2–3):195–214. https://doi.org/10.3354/cr01426
Article Google Scholar
Weston S (2017) doMPI: Foreach parallel adaptor for the Rmpi package. R package version 0.2.2. https://CRAN.R-project.org/package=doMPI – canonical link to current version, in case of updates v.0.2.2 can be accessed through the archive link.
Zhang N, Zhao C, Quiring SM et al (2017) Winter wheat yield prediction using normalized difference vegetative index and agro-climatic parameters in Oklahoma. Agron J 109(6):2700–2713. https://doi.org/10.2134/agronj2017.03.0133
Article Google Scholar
Zhang L, Zhang Z, Luo Y et al (2020) Combining optical, fluorescence, thermal satellite, and environmental data to predict county-level maize yield in China using machine learning approaches. Remote Sens 12(1):21. https://doi.org/10.3390/rs12010021
Article Google Scholar
Zhang N, Qu Y, Song Z, Chen Y, Jiang J (2022) Responses and sensitivities of maize phenology to climate change from 1971 to 2020 in Henan Province, China. Plos One 17(1):e0262289. https://doi.org/10.1371/journal.pone.0262280
Article CAS Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL. This work was financially supported by the German Federal Office for Agriculture and Food (BLE) in the framework of the national research project OptAKlim (grant no. 281B203316) and by the German Aerospace Center (DLR) on behalf of the Federal Ministry of Education and Research (BMBF) in the frameworks of the international BiodivERsA research project SALBES (grant no. 01LC1809B) and the international AXIS research project CROSSDRO (grant no. 01LS1901A).

Author information

Authors and Affiliations

Potsdam Institute for Climate Impact Research, Telegrafenberg A31, 14473, Potsdam, Germany
Tobias Conradt

Authors

Tobias Conradt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tobias Conradt.

Ethics declarations

Ethics approval

Not applicable, this study did not include any research on humans or animals.

Competing interests

The author declares no competing interests.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 5028 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Conradt, T. Choosing multiple linear regressions for weather-based crop yield prediction with ABSOLUT v1.2 applied to the districts of Germany. Int J Biometeorol 66, 2287–2300 (2022). https://doi.org/10.1007/s00484-022-02356-5

Download citation

Received: 02 March 2022
Revised: 17 August 2022
Accepted: 22 August 2022
Published: 03 September 2022
Issue Date: November 2022
DOI: https://doi.org/10.1007/s00484-022-02356-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Choosing multiple linear regressions for weather-based crop yield prediction with ABSOLUT v1.2 applied to the districts of Germany

Abstract

Similar content being viewed by others

LiDAR Data Fusion to Improve Forest Attribute Estimates: A Review

Enhancing crop recommendation systems with explainable artificial intelligence: a study on agricultural decision-making

Analysis of factors affecting evapotranspiration zoning

Introduction

Materials

Hard- and software

Input data

Specifics of the example application

Primary data from external sources

Methods

The five steps of ABSOLUT

Program 1: “the prospector”

Program 2: “the workhorse”

Program 3: “the gold pan”

Programs 4 and 5: “crucible and mould”

Setup of the Germany application

Results

Observations along the workflow for winter wheat

Running program 1

Running programs 2 and 3

Running programs 4 and 5

Prediction performance

Regional performance for 2018 silage maize yields

Yield predictions for Germany

Weather input of Gornott and Wechsung

Discussion

Performance in comparison to previous studies and official yield predictions

An over-confidence trap in statistical modelling

Improvement potentials and development opportunities

Conclusions

Data availability

Code availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethics approval

Competing interests

Supplementary Information

Supplementary file1 (PDF 5028 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation